Mapping Political Communities: A Statistical Analysis of...
Transcript of Mapping Political Communities: A Statistical Analysis of...
Mapping Political Communities: A Statistical Analysis
of Lobbying Networks in Legislative Politics∗
In Song Kim† Dmitriy Kunisky‡
First Draft: August 25, 2017This Draft: June 13, 2018
Abstract
Political connections between special interest groups and politicians play a significant rolein policymaking. Yet, empirical studies of interest group politics and representation have beenfundamentally limited by the difficulty of observing these ties directly. We bridge the two withan original dataset of two political behaviors around congressional bills: (1) sponsorship bypoliticians, and (2) lobbying by interest groups. We then develop a methodological frameworkto examine the political networks these data describe. Unlike networks in electoral politics,whose structure has been found to reflect ideology, we find distinct political communities inthe lobbying network, organized according to industry interests and jurisdictional committeememberships. Furthermore, we observe important actors having mixed memberships in multiplecommunities, capturing their simultaneous commitments to diverse political interests. Ourfindings provide evidence for the existence of powerful networks in U.S. legislative politics,quite distinct from electoral networks and exhibiting numerous unique structural features.
Keywords: Network analysis, lobbying, ideal point estimation, scaling, stochastic block model,link community model, community detection.
Word Count: 12,029 (abstract: 149)
∗We thank Pablo Barbera, J. Lawrence Broz, Devin Caughey, Marina Duque, Nolan McCarty, Cristopher Moore,Michael Peress, Yunkyu Sohn, Erik Voeten, and Hye Young You for helpful comments. We also thank seminarparticipants at the American Political Science Association annual meeting, Asian Political Methodology annualmeeting, Princeton University, University of California, San Diego, and University of Maryland. Kim acknowledgesfinancial support from the National Science Foundation (SES-1264090 and SES-1725235).†Assistant Professor, Department of Political Science, Massachusetts Institute of Technology, Cambridge, MA,
02139. Email: [email protected], URL: http://web.mit.edu/insong/www/‡Ph.D. Student, Department of Mathematics, Courant Institute of Mathematical Sciences, New York University,
New York, NY, 10012. Email: [email protected].
1 Introduction
Special interest groups engage in lobbying to promote their political objectives (e.g., Wright, 1996;
Grossman and Helpman, 2001).1 A dominant view among political scientists holds that interest
groups take part in such costly political activity to retain access to policymakers, and in return
policymakers gain an effective means of informing their legislative decisions (Bauer, Dexter, and
Poll, 1972; Potters and Van Winden, 1992; Austen-Smith and Wright, 1992; Wright, 1996). Others
consider whether policymakers might be compensated more directly through the related mechanism
of campaign contribution (Austen-Smith, 1995; Ansolabehere, Snyder, and Tripathi, 2002). Yet
another theory is that lobbying serves instead as a “legislative subsidy,” through which interest
groups help allied politicians pursue common objectives (Hall and Deardorff, 2006). In any case,
despite the significance of connections between interest groups and politicians for both politics and
economics (Khwaja and Mian, 2005; Faccio, Masulis, and McConnell, 2006; Faccio, 2006; Kang and
You, 2017), empirical studies of legislative politics have been limited by the difficulty of directly
observing these political ties, let alone the actual policy context in which they are formed.
In this paper, we identify a type of political connection that can be observed tractably between
every Member of the U.S. Congress and every actively lobbying special interest group. Our first
contribution is to construct a large political network dataset tracing the universe of lobbying
activities that are directly related to 108,086 congressional bills introduced between the 106th and
the 114th Congress. We then consider for further analysis a subset of these data joining two specific
types of political behavior around each bill: (1) sponsorship by a politician, and (2) lobbying by
interest groups. Although lobbying on a single bill does not necessarily imply political ties to its
sponsor, recurring instances of lobbying that involve the same interest group and sponsor across
several bills do reliably encode close political relationships.2 Therefore, analyzing how often various
politicians and interest groups coincide on individual bills lets us infer the structure of political
networks underlying the legislative process.
The left panel of Figure 1 shows that about 12,000 bills are introduced in each Congress and the
majority of them are lobbied by at least one interest group, while very few are eventually voted on
the floor. Note that the remaining bills do not necessarily “die.” Rather, they tend to be merged
into larger bills that are eventually voted on the floor through a complex process of amendment
and compromise. Our dataset allows us to observe political connections before this process takes
place, and therefore at a finer level of granularity. Indeed, as the right panel shows, most bills are
1We use the term special interest groups to refer to any political actors who have particular policy objectives,including firms, trade associations, labor unions, business associations, and professional associations.
2There is ample empirical evidence that lobbyists help to draft or even write bills on behalf of legislators withwhom they have political connections (Nourse and Schacter, 2002; Hertel-Fernandez, 2014).
1
106 108 110 112 114Congress
0
2000
4000
6000
8000
10000
12000
14000Nu
mbe
r of B
ills Bills Lobbied
Bills Introduced
Bills Voted
0 50 100 150 200+Number of Unique Interest Groups Lobbying per Bill
0
500
1000
1500
2000
2500
3000
Num
ber o
f Bills
in 1
13th
Con
gres
s
Figure 1: Descriptive Statistics of Lobbying on Congressional Bills. The left panel com-pares the numbers of bills introduced, lobbied, and voted on between the 106th and the 114thCongress. On average, about 12,000 bills are introduced in each Congress, only about 600 areeventually voted on the floor, but the majority are lobbied by at least one interest group. Theright panel shows the distribution of the number of interest groups lobbying on each bill in the113th Congress. The distribution is highly skewed: the median is three, the maximum is 978 and25% of the lobbied bills are lobbied by only one interest group. Many more bills are identified aslobbied beginning in the 110th Congress, when lobbying reports became digitized per the HonestLeadership and Open Government Act of 2007.
lobbied by very few interest groups, implying that each individual instance of lobbying tends to
reflect narrow interests. A typical example: on “A bill to exempt the aging process of distilled
spirits from the production period for purposes of capitalization of interest costs” (113th S. 1457),
sponsored by Senator Mitch McConnell (R-KY), the Distilled Spirits Council of the U.S.
was the only interest group to lobby.3
Our second contribution is to develop statistical network models to examine the underlying
factors that might drive political connections. We begin by introducing a latent space model, an
extension of the widely used item response theory (IRT) model, to estimate a latent “ideal point”
or “preferred policy” for each political actor. This approach has been widely used to study various
legislative bodies, including state legislatures (Shor and McCarty, 2011), the U.S. Congress (Fowler,
2006), and the United Nations (Bailey, Strezhnev, and Voeten, 2017). Unlike existing studies that
rely on roll calls for the few bills that are voted on the floor (Poole and Rosenthal, 2011; Clinton,
Jackman, and Rivers, 2004), we analyze all introduced bills to infer underlying policy preferences,
as reflected at the point of sponsorship.
Our model locates legislators and interest groups in a common “marketplace,” in which prox-
imity implies a closer alignment of unobservable interests. We find that, in this marketplace,
there is clustering of interest groups and legislators according to their industry affiliations and
3See Kim (2017) for examples of similar patterns in trade bills.
2
memberships in committees with jurisdiction over those industries, respectively. This is different
from findings based on similar models in other political contexts, such as campaign contributions
and social media following (Bonica, 2013; Barbera, 2014; Bond and Messing, 2015), where the
revealed underlying preferences tend to align strongly with existing measures of ideology, such as
DW-NOMINATE scores (Poole and Rosenthal, 2011). Instead, our finding is consistent with the
thesis of Ansolabehere, Snyder, and Tripathi (2002), who argue that political actors who act mainly
through lobbying are more bipartisan and less ideological than those who act mainly through cam-
paign contributions. It also attests to the influence committees have over policy outcomes through
intervention at several stages of the legislative process, as described by Shepsle and Weingast
(1987). In short, interest groups target influentially-positioned legislators in interactions localized
to specific policy domains, forming communities in the lobbying network.
We then propose a new methodological framework tha incorporates the community structure we
identify in the spatial model. We begin by applying the stochastic block model, a sociological tool
(Snijders and Nowicki, 1997) more recently adopted in physics and machine learning (Fortunato,
2010). We verify that the communities identified by the stochastic block model largely agree
with those identified ex post in the latent space model, confirming that the lobbying network
exhibits significant community structure. However, we also observe an important and qualitatively
different behavior in the lobbying network, namely that many interest groups and politicians tend
to represent mixed interests across several political communities.
To study the effects of this, we develop a new statistical model that allows actors to belong to
multiple political communities (so-called “mixed membership”). The proposed model finds that
interest groups such as the Chamber of Commerce that represent the heterogeneous interests of
many members engage in aggregate lobbying, highly mixed among many communities, often target-
ing politicians who are members of broad “procedural” committees such as the House Committee
on Ways and Means and the Senate Committee on Appropriations. This is consistent with
Wright (1990) who, based on numerous personal interviews and a survey of lobbyists, found the
prevalence of lobbying by powerful interest groups to affect politicians’ voting especially strongly
within the House Committee on Ways and Means. Our model also captures the intermedi-
ate lobbying behavior of firms involved in a few industries at once, and the focused lobbying of
industry-specific organizations observed previously. In this regard, the proposed methodology im-
proves our understanding of the high-level organization of lobbying in the U.S. Congress, accurately
capturing the full spectrum of behaviors from very targeted to very broad lobbying.
Taken together, our results illustrate a nuanced community structure in the lobbying network,
remarkably different from most political networks studied previously. We find evidence undermining
3
the broad claim that “lobbying is specialized,” which plays a key role in the argument of Hall and
Deardorff (2006), as our analysis shows that mixed interests and the spectrum between specialist
and generalist interest groups are in fact crucial distinctions for describing lobbying activity.4 The
lobbying network, we find, must be understood as a collection of interlocking topical “arenas” in
which lobbying on particular issues occurs, and both the individual arenas and the mechanism of
mixed interests through which they interact are important in generating the diversity of behavior
observed in the lobbying network. More broadly, our results call for the mechanisms through
which committee power is exercised to express policy preferences in legislative politics to be more
carefully considered, as has been previously explored by Shepsle and Weingast (1987); Schickler
(2001); Sinclair (2016); and others. In particular, these mechanisms cannot be understood merely
by analogy with public-facing politics: our results show that the interactions of legislators with
interest groups are quite different from their position-taking interactions with the public at large
as observed through roll call votes and the like, the latter being driven primarily by electoral
motivations (Mayhew, 1974).
We also believe that our network analyses will provide useful technical guidance for further
research in this direction. To the best of our knowledge, ours is the first statistical study of
lobbying networks in legislative politics in their entirety, including both politicians and interest
groups. The proposed community detection approach introduces a novel empirical framework for
applied political scientists to systematically identify political communities and characterize their
memberships, which will help leverage the rich network data that have become increasingly available
in various subfields of political science (Keck and Sikkink, 1998; Hoff, Raftery, and Handcock, 2002;
Maoz et al., 2006; Hafner-Burton, Kahler, and Montgomery, 2009; Clark and Lauderdale, 2010;
Ward, Stovel, and Sacks, 2011).
The rest of the paper is organized as follows. Section 2 introduces a database of lobbying and
a network dataset obtained from it. In Section 3, we introduce a Bayesian Latent Space Network
Model (LSNM) and apply it to lobbying data from the 113th Congress. We then introduce the
community detection framework and develop the Bipartite Link Community Model (biLCM) in
Section 4. Lastly, Section 5 gives some concluding remarks. Open-source software implementing
the proposed methods will be made available as a package for the R and Python languages. The
network data, all estimated pairwise measurements of political connections between politicians and
interest groups, the estimated spatial preferred policy locations and their posterior distributions,
and the visualization tools used in preparing this paper will also be made publicly available.
4We do not address in this work the more intricate question of whether lobbyists, rather than interest groups, aretypically specialists or generalists, though we believe the lobbying network dataset will allow future work to conductsuch analyses.
4
2 The Lobbying Network Database
The Lobbying Disclosure Act of 1995 (amended by the Honest Leadership and Open Government
Act of 2007) requires mandatory quarterly electronic filing for any organization (“client”) that
“actively participates” in lobbying. Filings must disclose the general lobbying issue area, the con-
gressional and federal agencies contacted, and the specific issues on which lobbyists (“registrants”)
have engaged in political activities.5 Based on the information available from this disclosure of
political activities, researchers have studied various aspects of the politics of lobbying, including
the sources of and returns on decisions to lobby (Richter, Samphantharak, and Timmons, 2009;
De Figueiredo and Richter, 2014; Kang, 2015; You, 2017), the similarities and differences between
campaign contribution and lobbying (Ansolabehere, Snyder, and Tripathi, 2002), characteristics of
lobbyists (Baumgartner et al., 2009; Vidal, Draca, and Fons-Rosen, 2012; Bertrand, Bombardini,
and Trebbi, 2014), and the implications of lobbying for trade politics (Bombardini and Trebbi,
2012; Kim, 2017). However, the available lobbying data has the important limitation that interest
groups are not required to report the identities of their political contacts. This is unfortunate, given
that almost 90% of lobbying reports indicate that at least one Member of Congress or a member
of their staff was indeed contacted. In order to study the connections between interest groups and
politicians, there is therefore no choice but to infer those unobserved connections statistically from
the available data.
To that end, we construct an original database describing the lobbying network, which consists
of all of the links described above among lobbyists, interest groups, bills, and politicians. The
database is generated from lobbying reports filed between 1999 and 2017, which are available
from the Senate Office of Public Records (SOPR). A key component of the database is a suite of
automated systems to (1) detect congressional bills reported to have been lobbied, (2) identify the
session of Congress those bills occurred in, and (3) relate each bill to its sponsor. Below, we give
a brief outline of the process used to construct the lobbying database.
First, we identify all lobbying activities related to legislative bills, which is possible due to 2
U.S.C. §1604(b)(2)(A) legally requiring interest groups to disclose the “list of bill numbers” lob-
bied.6 For example, a report filed by Intel Corporation mentions “H.R. 289, National STEM
Education Tax Incentive or Teachers Act of 2011—Science and math education legislation.” In
practice, bills are often referenced within the text of a lobbying report, and sometimes only by
alternative names or subtitles, so the entire report text must be processed and all bill references
5See https://lobbyingdisclosure.house.gov/amended_lda_guide.html for the definition of lobbying.6Compliance with disclosure requirements is closely monitored and enforced. It is annually audited by the Govern-
ment Accountability Office (GAO). According to the 2014 audit report by GAO, 90% of organizations filed reports asrequired, and 93% could provide documentation related to expenses. The 2014 GAO report on lobbyists’ compliancewith disclosure requirements is available from http://www.gao.gov/products/GAO-15-310.
5
algorithmically identified. Second, we identify the session of Congress that each bill belongs to,
which often is not mentioned explicitly, as in the above example. We use text data mining tech-
niques to determine whether information such as the bill title or phrases like “Act of [YEAR]”
occur in the report text. Based on our algorithm, described in greater detail in Appendix A.1,
we correctly determine that, for instance, the above bill H.R. 289 is from the 112th Congress.
If the year that the lobbying report was filed had been used to determine the Congress instead,
then this bill would have been identified as H.R. 289 “Value Our Time Elections Act” from the
113th Congress (since the report was filed in 2013).7 Finally, we identify the sponsor of each bill,
completing the connection between lobbying clients and politicians. We repeat this process over
each of 1,111,859 lobbying reports, which in total link 20,092 special interest groups with 1,164
(current and former) Members of Congress.
Figure A.3 in the Appendix shows that lobbying and sponsorship tend to be heavily skewed
in their frequency, in that most politicians and interest groups have very few interactions, while a
very small number are highly active. The incidences of politicians and interest groups on bills are
similarly skewed, and are moreover very sparse, most pairs of actors not coinciding on any bills at
all, and a few coinciding on many. For instance, 98.28% of pairs of actors in the 113th Congress do
not coincide on any bills, while the pair with the greatest number of bills lobbied and sponsored
in common during the 113th Congress consists of Senator Bernard Sanders (D-VT) and the Iraq
and Afghanistan Veterans of America, having 25 common bills; we will later see that a
group of veterans’ associations in fact forms one of the significant clusters in our models, which
ultimately belongs to a larger but subtler cluster related to issues of civil society. More details on
our datasets are given in Appendix A.2.
Notations Before describing our models, we introduce some mathematical notation and short-
hand terminology that will be useful in the sequel. We index a set of interest groups (we will
sometimes use the term “lobbying clients” or simply “clients”, this being the terminology of the
Lobbying Disclosure Act, as well as the more colloquial “firms”) by [m] = {1, . . . ,m} and a set
of legislators by [n]. When a bill is reported as lobbied on by client i and sponsored by politician
j, we say that the client and politician coincide on that bill (coinciding pairs will also sometimes
be called “partners”), and each lobbying report client i writes that mentions the bill is a political
incidence. We denote the number of incidences (over some period of time, usually a single ses-
sion of Congress) between the pair i, j by Ai,j , and organize these numbers as the entries of the
incidence matrix A ∈ Rm×n. This matrix may be viewed as the adjacency matrix of a bipartite
7In fact, the OpenSecrets.org database from the Center for Responsive Politics, which is often used in academic re-search, erroneously finds that Intel lobbied on the latter. See http://www.opensecrets.org/lobby/billsum.php?id=hr289-113.
6
graph G with weighted edges, where the politicians and interest groups lie on opposite sides of
the partition. The degree (a term from network theory) of a client or politician is the sum of the
incidences between them and every actor of the other type. Therefore, the degree of a politician
is the total number of times bills they have sponsored have been lobbied on, and the degree of an
interest group is the total number of times they have lobbied on individual bills.
3 Latent Space Modeling of the Lobbying Network
In this section, we develop a Bayesian Latent Space Network Model (LSNM). Following the in-
troduction of this style of model by Hoff, Raftery, and Handcock (2002), there has been a great
proliferation of variants, of which we mention just a few that are especially useful for understanding
our methods. Our model is formally similar to the “Wordfish” model of Slapin and Proksch (2008)
for text analysis, but while we estimate the underlying preference structure of two distinct groups
of interacting political actors, Slapin and Proksch (2008) estimate ideal points of a single group of
political agents based on text data. Our model is also closely related to that of Barbera (2014),
which relates ideal points of politicians and social media users, but instead of modeling whether
a binary political interaction exists (e.g., Twitter follows), we consider how frequently political
contacts occur, our interaction measurements therefore having an associated magnitude.
The Model We model the connection between client i and legislator j as a function of the two
actors’ latent preferred policy positions in a d-dimensional Euclidean space, θi ∈ Rd and ψj ∈ Rd
respectively, and assume that the frequency of their interactions Ai,j has a Poisson distribution,
with its mean parametrized by the Euclidean distance ‖θi −ψj‖.To account for the differences in agents’ baseline propensities to sponsor or lobby (see Fig-
ure A.3), we also include client- and legislator-specific “popularity” terms, αi and βj respectively,
in the mean of the modeled Ai,j .8 We then take the probabilistic model
Ai,j ∼ Poisson(exp(αi + βj − ‖θi −ψj‖)) (1)
for the entries, which we model as independent conditional on the parameters, to obtain the
following joint distribution:
P (A | α,β,θ,ψ) =m∏i=1
n∏j=1
Poisson (Ai,j | exp (αi + βj − ‖θi −ψj‖)) . (2)
We take a further hierarchical extension of this model where the latent space positions of clients
and politicians are distributed with multivariate normal population distributions, N (0, diag(τ ))
8These variables in the literature are variously called “popularity”, “gregariousness”, or “idiosyncratic” factors(we adopt the first term). A similar construction to ours was tested on synthetic datasets by Krivitsky et al. (2009).
7
and N (µ,Σ) respectively, where the distribution of client positions is centered and assumed to have
a diagonal covariance matrix to account for translational and rotational invariances of the latent
space positions. Likewise, popularity factors are assumed to be drawn from normal population
distributions, N (ν, σ2(L)) andN (0, σ2(P )), where the latter is centered to account for the translational
invariance of the popularity factors. The posterior distribution under this model is then:
P (α,β,θ,ψ | A) ∝m∏i=1
n∏j=1
Poisson (Ai,j | exp (αi + βj − ‖θi −ψj‖))×
m∏i=1
N (θi | 0, diag(τ ))×n∏j=1
N (ψj | µ,Σ)×
m∏i=1
N (αi | ν, σ2(L))×n∏j=1
N (βj | 0, σ2(P )). (3)
Identifiability The hierarchical priors we use resolve identifiability issues due to the invariance
of θi and ψj under rotations and translations, and the invariance of α and β under translations.
The remaining identifiability issue is the invariance of θi and ψj under reflections, which makes
the posterior distribution multimodal. This issue becomes more significant as the latent space
dimension increases: in dimension d, one expects the posterior to have 2d modes, one for each
choice of which axes to reflect across. We will only consider d ∈ {1, 2}, and with d = 2 we find that
this issue can still be resolved by the same method as is usual with d = 1 in ideal point estimation,
namely by fixing the position of one agent whose position we believe to be far from the origin of
the latent space. In practice, even fixing the starting position of one important agent to lie in a
specified orthant appears to suffice to ensure that MCMC sampling will only explore one mode of
the posterior. As we will see, the agents on the periphery of the latent space tend to belong to
particular industries. Thus, we choose one significant client from such an industry (we have found
large energy holdings companies, which together form the “core” of the very large energy industry
cluster, to work well for this purpose) and fix its starting position in a particular orthant of Rd.
For a thorough discussion of this problem when d = 1, see Bafumi et al. (2005). For some technical
discussion of an unusual identifiability issue we observed when replacing ‖θi − ψj‖ in our model
with ‖θi −ψj‖2, see Appendix A.3.4.
Computation We perform inference for our model with the Hamiltonian Monte Carlo No-U-
Turn Sampler implementation in the Stan software package (Carpenter et al., 2017). In the results
presented here, we use estimated posterior means of our parameters taken as point estimates, and
perform further analysis of these point estimates so as to focus on politically meaningful findings.
We give further details, a more technical evaluation of our implementation and of concentration of
the posterior distribution, and the Stan code describing our model in Appendix A.3.
8
One-Dimensional LSNM Two-Dimensional LSNM
2 0 2 41D Latent Space Position: Politicians
1.00
0.75
0.50
0.25
0.00
0.25
0.50
0.75
1.00
DW-N
OMIN
ATE
Dim
ensio
n 1
( = 0.13)
4 2 0 2 42D Latent Space Dimension 1: Politicians
( = 0.12)
5.0 2.5 0.0 2.5 5.02D Latent Space Dimension 2: Politicians
( = 0.19)
Figure 2: DW-NOMINATE Ideology vs. LSNM. We illustrate the lack of correlation betweenpoliticians’ latent space positions obtained from one- and two-dimensional LSNM estimates and theDW-NOMINATE ideology dimension. It shows that the ideological structures common in electoralpolitics are not prevalent in the lobbying network. Points are colored on a linear scale from blueto red according to the DW-NOMINATE dimension as well, to introduce a convention we will usethroughout the paper.
Empirical Findings We apply the proposed model to the lobbying network dataset for the 113th
Congress. We first consider a one-dimensional LSNM, the simplest model our framework admits.
To compare with the situation in electoral politics, we examine how estimated latent preferred
policy positions of politicians (ψj ∈ R) correlate with the DW-NOMINATE ideology measures.
As the left panel of Figure 2 shows, we do not find evidence that ideological differences drive
the interaction between interest groups’ lobbying and politicians’ legislative activities (Pearson’s
ρ = 0.13 between our latent dimension and the DW-NOMINATE ideology dimension). This lack
of correlation holds for restrictions to only Democrat and Republican Members of Congress, as
well. Our finding is consistent even when we increase the dimensionality of the latent positions.
Specifically, the right panel shows that each of the estimated latent positions are weakly correlated
with the ideology score.
Increasing the dimensionality of the LSNM, however, facilitates the interpretation of the esti-
mates.9 For the two-dimensional LSNM, Figure 3 presents the posterior means of the estimated
latent spatial positions, θi and ψj , for all clients i and politicians j.10 Each client and each politi-
cian is represented by a circle, the clients’ circles colored black, and the politicians’ circles colored
according to their DW-NOMINATE ideology dimension. The size of each circle is proportional to
the exponential of their popularity factor (αi and βj), so that the mean of the Poisson interaction
between two agents at fixed distance is proportional to the product of their circles’ sizes.
9For more technical analysis of choice of dimensionality for the LSNM based on heuristics from spectral methods,the reader may consult Appendix A.3.1.
10We analyze the posterior distributions further in Appendix A.3, with particular attention to the uncertainties ofthe point estimates given by the posterior means to illustrate the stability of these results.
9
43
21
01
23
4La
tent
Spa
ce D
imen
sion
1
4321012345
Latent Space Dimension 2
Polit
icia
ns (D
emoc
rat)
Polit
icia
ns (R
epub
lican
)Lo
bbyi
ng C
lient
s
Goo
gle
Yaho
o!Am
azon
Mas
terc
ard
Dell
Texa
s In
stru
men
tsO
racle
Hewl
ett P
acka
rd
Darre
ll Iss
a (R
-CA)
Jaso
n Ch
affe
tz (R
-UT)
John
Cor
nyn
(R-T
X)O
rrin
Hatc
h (R
-UT)
Patri
ck L
eahy
(D-V
T)Zo
e Lo
fgre
n (D
-CA)
Char
les
Schu
mer
(D-N
Y)
Technology
Ntl. A
ssn.
of B
road
cast
ers
Cinc
inna
tti B
ell
AT&T
Cor
pora
tion
Tim
e W
arne
r Cab
leVe
rizon
Gov
. Rel
atio
nsTw
enty
-Firs
t Cen
tury
Fox
Bob
Goo
dlat
te (R
-VA)
Mike
Rog
ers
(R-M
I)M
ichae
l McC
aul (
R-TX
)Jo
hn R
ocke
felle
r (D-
WV)
Ron
Wyd
en (D
-OR)
Doris
Mat
sui (
D-CA
)
Telecom
Earth
just
ice L
egal
Def
ense
Fund
Blue
Gre
en A
llianc
eSi
erra
Clu
bTh
e W
ilder
ness
Soc
iety
Safa
ri Cl
ub In
tern
atio
nal
Amer
ican
Chem
istry
Coun
cil
Rand
Pau
l (R-
KY)
David
Vitt
er (R
-LA)
Paul
Gos
sar (
R-AZ
)Ba
rbar
a Bo
xer (
D-CA
)To
m U
dall (
D-NM
)Ro
bert
Case
y (D
-PA)
Environm
entalism
Berk
shire
Hat
hawa
y En
ergy
Koch
Co.
Pub
lic S
ecto
rNa
tiona
l Grid
USA
Rene
wabl
e Fu
els A
ssn.
Allia
nce
of A
utom
obile
Mnf
.Ar
ch C
oal
BP A
mer
icaCh
evro
n US
A
Pom
peo,
Mike
(R-T
X)Fl
ake,
Jef
f (R-
AZ)
McK
inle
y, Da
vid (R
-WV)
Barra
sso,
Joh
n (R
-WY)
Edwa
rd M
arke
y (D
-MA)
Shah
een,
Jea
nne
(D-N
H)
Energy
US T
rave
l Ass
n.M
arrio
tt In
tern
atio
nal
Glob
al B
usin
ess
Trav
elAs
sn.
Mor
phot
rust
USA
Jose
ph H
eck
(R-N
V)Ca
ndic
e M
iller
(R-M
I)J.A
. Em
erso
n (R
-MO
)Tr
ey G
owdy
(R-S
C)M
ike
Qui
gley
(D-I
L)
Tra
vel
Mut
ual o
f Om
aha
Aflac
Cor
pora
tion
Nat
ionw
ide
Mut
ual
Alls
tate
Insu
ranc
eAm
eric
an F
amily
Mut
ual
Mic
hael
Grim
m (R
-NY)
R. N
euge
baue
r (R-
TX)
Bill
Hui
zeng
a (R
-MI)
M. C
apua
no (D
-MA)
R. M
enen
dez
(D-N
J)
Insu
ranc
e
Capi
tal O
ne F
inan
cial
Amer
ican
Ban
kers
Ass
n.Ba
nk o
f Am
eric
aSe
curit
ies
Indu
stry
and
Fin.
Mar
kets
Ass
n.
Scot
t Gar
rett
(R-N
J)Bo
b Co
rker
(R-T
N)
Gus
Bilir
akis
(R-F
L)M
axin
e W
ater
s (D
-CA)
Jani
ce H
ahn
(D-C
A)
Fina
nce
Nor
thw
este
rn U
nive
rsity
Vand
erbi
lt Un
iver
sity
Univ
ersi
ty o
f Virg
inia
Mas
sach
uset
ts In
stitu
teof
Tec
hnol
ogy
McG
raw
Hill
Edu
catio
n
Virg
inia
Fox
x (R
-NC)
Luke
Mes
ser (
R-IN
)Ch
ris C
ollin
s (R
-NY)
Spen
cer B
achu
s (R
-AL)
Joe
Man
chin
(D-W
V)H
enry
John
son
(D-G
A)
Uni
vers
itie
s
Amer
ican
Col
lege
of
Emer
genc
y Ph
ysic
ians
AARP
Hea
lthca
re L
eade
rshi
pCo
unci
lAs
sn. o
f Am
eric
anM
edic
al C
olle
ges
Amer
ican
Ass
n. o
f Nur
sePr
actit
ione
rs
Mic
hael
Bur
gess
(R-T
X)D
iane
Bla
ck (R
-TN
)Th
omas
Cob
urn
(R-O
K)La
rry
Bucs
hon
(R-I
N)
Thom
as H
arki
n (D
-IA)
Jose
ph C
row
ley
(D-N
Y)D
onna
Edw
ards
(D-M
D)
Hea
lth
Gun
Ow
ners
of A
mer
ica
Ever
ytow
n fo
r Gun
Saf
ety
Actio
n Fu
ndN
tl. R
ifle
Assn
. of
Amer
ica
Stev
e St
ockm
an (R
-TX)
Jim Jo
rdan
(R-O
H)
Robi
n Ke
lly (D
-IA)
Dan
ny D
avis
(D-I
L)
Gun
Rig
hts
Iraq
& Af
ghan
ista
n Ve
tera
ns o
f Am
eric
aFl
eet R
eser
ve A
ssn.
Retir
ed E
nlis
ted
Assn
.Un
ited
Spin
al A
ssn.
Amer
ican
Leg
ion
Jeff
Mill
er (R
-FL)
Rich
ard
Burr
(R-N
C)M
ike
Coffm
an (R
-CO
)D
ina
Titu
s (D
-NV)
Rick
Lar
sen
(D-W
A)Be
to O
’Rou
rke
(D-T
X)
Mili
tary
& V
eter
ans’
Affa
irs
Fig
ure
3:
Est
imate
dLSNM
Posi
tion
s:Fu
ll113th
Con
gre
ss.
Th
isfi
gure
pre
sents
the
two-
dim
ensi
onal
late
nts
pac
ep
osit
ion
san
dp
opu
lari
tyfa
ctor
sin
ferr
edfr
omth
e113
thC
on
gres
sdata
set.a
We
ind
icat
ese
ver
alsi
gnifi
cant
clu
ster
sco
rres
pon
din
gto
spec
ific
ind
ust
ries
and
issu
ear
eas.
Th
eco
lor
dim
ensi
on
scal
esfr
om
red
tob
lue
wit
hth
eD
W-N
OM
INA
TE
ideo
logy
dim
ensi
on,
and
the
poi
nt
size
isp
rop
orti
onal
toth
eex
pon
enti
alof
anag
ent’
sp
opu
lari
tyfa
ctor
(exp
(αi)
orex
p(βj))
.W
ean
not
ate
the
clu
ster
sw
ith
som
ere
pre
senta
tive
mem
ber
s;fo
rco
mp
lete
mem
ber
ship
list
sfo
rea
chof
the
hig
hli
ghte
dcl
ust
ers,
con
sult
Ap
pen
dix
A.7
.T
he
regi
onin
dic
ated
wit
ha
das
hed
circ
leis
the
“cen
ter
clu
ster
”w
her
ew
eob
serv
ea
stro
ng
con
centr
atio
nof
poli
tici
an
sb
elon
gin
gto
pro
ced
ura
lco
mm
itte
es,
wh
ich
we
wil
lan
alyze
exte
nsi
vel
yw
ith
ali
nk
com
mu
nit
ym
od
elin
Sec
tion
4.2.
aT
wo
agen
tsw
ith
outl
yin
gla
tent
space
posi
tions
(bel
owth
ere
gio
nsh
own),
Rob
Woodall
(R-G
A)
andW
elchAlly
n,In
c.,
are
om
itte
dfo
rth
esa
ke
of
vis
ual
clari
ty.
10
(H) Fina
ncial
Servi
ces
(S) Ban
king,
Housin
g, & Urba
n Affa
irs
(H) Bud
get
(S) App
ropria
tions
(J) Eco
nomic
(H) Tran
sporta
tion &
Infra
struct
ure
(S) Sp
ecial
Aging
(H) Hom
eland
Secur
ity
(H) Small
Busine
ss
(H) Way
s & Mea
ns0369
1215182124
Num
ber o
f Pol
iticia
ns
Finance & Insurance Clusters
(H) Ene
rgy & Com
merce
(H) Natu
ral Reso
urces
(H) Arm
ed Se
rvices
(S) Agri
cultur
e, Nutr
ition,
& Fores
try
(H) Tran
sporta
tion &
Infra
struct
ure
(H) Ove
rsigh
t & Gov
. Refo
rm
(S) En
viron
ment &
Public
Work
s
(S) En
ergy &
Natural
Resourc
es
(S) Fo
reign
Relatio
ns
(S) Sm
all Busi
ness
& Entre
prene
urship
Energy Cluster
(H) Way
s & Mea
ns
(H) App
ropria
tions
(H) Edu
cation
& the W
orkfor
ce
(H) Tran
sporta
tion &
Infra
struct
ure
(H) Ene
rgy & Com
merce
(H) Scie
nce, S
pace,
& Techn
ology
(H) Bud
get
(H) Fina
ncial
Servi
ces
(H) Judic
iary
(S) App
ropria
tions
Center Cluster
Figure 4: Committee Membership of Politicians Per Cluster. Histograms of the top 10committee memberships for politicians in the larger clusters highlighted in Figure 3. Bars aredivided by political party (red for Republican and blue for Democrat), and the horizontal bars atthe top show the overall party distribution of each cluster. Senate committee and House committeelabels are prefixed with (S) and (H), respectively.
Figure 3 shows that the key structure emerging from this two-dimensional model is industry-
related, domain-specific interests of political actors. In fact, significant clustering appears in the
LSNM representation as indicated by the outlines and lists of example members in the figure. As
shown by the example group members, the clustering among lobbying clients is strongly industry-
based. Importantly, these clusters emerge even though the clients involved do not lobby on the
same bills (recall that most bills are only lobbied on by a few clients). Instead, related clients are
“pulled together” through their connections to the same politicians, who sponsor separate bills on
which the clients separately lobby. As in the one-dimensional model, there is no clear partisan
divide either across or within industry clusters. Rather, we find in each large cluster of clients a
concentration of politicians from relevant committees involved with the policy issues that directly
affect those clients. For instance, Figure 4 shows that the legislators who belong to the “Finance”
and “Insurance” clusters are likely to serve in the House Financial Services Committee, and
those who belong to the “Energy” cluster are likely to serve in the House Committee on Energy
and Commerce.11
To formally investigate the importance of industry and committee affiliations, we conduct a
series of statistical tests. Specifically, we compare the performance of “Restricted” and “Full”
models whereby the former is nested in the latter. Table 1 presents the F-statistics and p-Values
summarizing the significance of obviating a certain set of observable covariates from the “Full”
model. We find that committee memberships are the only significantly informative covariates of
11We also provide the full lists of cluster memberships in Appendix A.7 for reference.
11
Independent Variables Restricted Full Restricted Full Restricted FullChamber X X X X X XGender X X X X X XLeadership Position X X X X X XParty X X X X X XCommittees X X X X XState X X X X XDW-NOMINATE X X X X XDependent Variable F -Statistic (p-Value) F -Statistic (p-Value) F -Statistic (p-Value)1D LSNM 2.86 (1.55e−8) 1.19 (0.18) 1.38 (0.25)2D LSNM, Dimension 1 2.68 (1.22e−7) 1.76 (1.25e−3) 0.97 (0.38)2D LSNM, Dimension 2 3.10 (8.85e−10) 1.19 (0.18) 1.22 (0.30)
Table 1: Regression Analysis of LSNM Covariates. Statistics of linear models of numerouscovariates against the one- and two-dimensional LSNM latent dimensions for politicians are pre-sented. We compare full linear models with all available covariates included against restricted mod-els, omitting either committee-related covariates, geographic information, or ideology measures. Inthe bottom three rows, for each of the three latent dimensions, we give the F statistics and p-values of the comparison F -test. The test illustrates that the committee membership covariatesare significantly explanatory of the LSNM latent space organization, and, with lesser confidence,geographic covariates are also explanatory of one dimension in the two-dimensional LSNM.
politicians’ positions in the one-dimensional model (see the row labeled as 1D LSNM). In the two-
dimensional model, we find some evidence for the importance of geographic location. However, we
consistently find that including committee memberships significantly improves the statistical fit of
the model, whereas ideology plays insignificant roles.12
Not all regions in our spatial representation, however, are characterized by domain-specific
political interests. We find that a region in the center of the latent space (marked by the dotted
circle in Figure 3) is populated by politicians with committee memberships that are not directly
related to the industries of the relevant clients (which vary widely) but rather concern federal
spending and taxation, most prominently the House Committee on Ways and Means and
House Committee on Appropriations, as shown in Figure 4. Several of the highest-degree
lobbying clients, such as Chamber of Commerce (the single highest-degree lobbying client in the
network), lie in this region. The cluster also includes many of the highest-degree politicians (who
are also often among the most senior politicians), such as Thomas Harkin (D-IA), Patty Murray
(D-WA), Carl Levin (D-MI), Harold Rogers (R-KY), and Chuck Grassley (R-IW). We defer further
interpretation of the characteristics they have in common to Section 4.2, where we will introduce
mixed-membership community models in part to answer this question.
The analysis presented in this section demonstrates the importance of political communities
in the lobbying network, organized according to industry interests and jurisdictional committee
memberships. This is in contrast to the dominance of ideological factors that underlie the networks
12Similarly, we find that industry affiliation, measured by interest group’s NAICS industry code, is the primaryfactor that is significantly related to the estimates.
12
of electoral politics mentioned before. Although the identification of clusters and their visual
demarcations can still be seen as arbitrary, the spatial location of estimated “preferred policy”
exhibit substantively meaningful patterns. In fact, the relative geometry of the clusters is consistent
with common intuitions about relationships among industries: we observe adjacencies between
clusters corresponding to “Finance” and “Insurance”, “Technology” and “Telecom”, “Energy” and
“Environmentalism.” Having bolstered our belief that political communities exist in the lobbying
network, we next seek to study them in a more explicit and statistically principled manner, adapting
our modeling techniques to better suit community structure.
4 Community Modeling of the Lobbying Network
In this section, we introduce new models to clarify the interpretation of the political community
structure that we identified in Section 3. We use variants of the stochastic block model (SBM) to
explicitly model the interaction between a politician and a client as a function of their community
memberships, so that politicians and interest groups with shared group memberships interact more
frequently. Unlike the LSNM, the SBM explicitly assumes a data generating process in which
communities exist. Furthermore, being a probabilistic model, it gives a principled way to estimate
the community parameters, unlike our ex post analysis of the LSNM. The findings from these
models confirm that political communities do exist, and that most of them are organized around
industries and associated committees. They also suggest some revisions of the clustering we found
from the spatial representation of the lobbying network given by the LSNM, capturing several new
clusters that were not previously apparent. Finally, as we will see, these models lead us to discover
novel organization in the lobbying network stemming from mixed interests.
4.1 Motivation: The Bipartite Stochastic Block Model
We base our initial analysis on the Bipartite Stochastic Block Model (biSBM) developed by Lar-
remore, Clauset, and Jacobs (2014). As we will see, though this model confirms the existence of
communities in the lobbying network, it will also raise modeling concerns that will require new
techniques to address. We first review the definition of this model in our particular setting below.
The Model Following the formulation of the LSNM in Section 3, we model Ai,j as having a
Poisson distribution, whose mean now depends exclusively on the cluster memberships of client i
and politician j. To formalize this for our case, we introduce a matrix parameter B ∈ Rk×` with
entries Br,s, where k is the number of client clusters and ` the number of politician clusters, viewed
as hyperparameters.13 To fix these, we draw upon our exploratory analysis with the LSNM, which
suggested roughly how many distinct clusters we expect to exist. Using the analysis of Figure 3,
13The original work of Larremore, Clauset, and Jacobs (2014) treats as multiple edges what we treat as non-negativeinteger edge weights, but formally the models are identical.
13
we choose to use eight politician and lobbying clients throughout our analysis. The membership
parameters are denoted by xi ∈ [k] and yj ∈ [`] for each i ∈ [m] and j ∈ [n]. As before, “popularity
factor” variables αi and βj are introduced per client and politician respectively. The entries Ai,j
are then modeled as
Ai,j ∼ Poisson(αiβjBxi,yj ) (4)
independently, so that the joint probability is:
P(A | B,x,y,α,β) =m∏i=1
n∏j=1
Poisson(Ai,j | αiβjBxi,yj ). (5)
In the block modeling literature the inclusion of the variables αi and βj is known as degree cor-
rection, because these variables adjust the model to account for broad degree distribution in a
network. The model described by equation (4) is therefore named the the Degree-Corrected Bipar-
tite Stochastic Block Model (dc-biSBM), while the model with these variables omitted is simply the
Bipartite Stochastic Block Model (biSBM).14
Computation We perform maximum likelihood parameter estimation for both the biSBM and
the dc-biSBM using the Expectation-Maximization (EM) algorithm implementation provided with
the work of Larremore, Clauset, and Jacobs (2014). Since this algorithm is not guaranteed to
converge to the parameters maximizing the likelihood, we take the parameters obtained from 50
randomly initialized runs and choose those that attain the highest likelihood value.
Empirical Findings Previously, we interpreted the proximity of latent positions obtained by the
LSNM as an indicator of common community membership. To validate this interpretation, we begin
by comparing the LSNM results from Section 3 with the community memberships found by the
dc-biSBM. Figure 5 overlays the clusters identified by dc-biSBM on the geometry obtained from the
LSNM. To facilitate a direct comparison, we show the latent positions of each individual political
actor as well as the grey boundaries previously used to group clients and politicians. We then pair
the dc-biSBM communities of clients and politicians (under maximum likelihood assignment) by
spatial proximity, assign different colors and markers to visually distinguish the eight community
pairs, and label them according to natural interpretations of their members.
The figure shows that the LSNM and dc-biSBM community structures align remarkably well
with each other: the boundaries that we drew indeed correspond to distinct political communities
or to mixtures of at most two or three political communities. Indeed, in one case, for the “Energy”
cluster, the two models find essentially identical clusters: 98% of the lobbying clients and 94% of the
14We will work primarily with the dc-biSBM in the following analysis; further discussion of its advantages over thebiSBM for analyzing the lobbying network is given in Appendix A.4.
14
4 3 2 1 0 1 2 3 4Latent Space Dimension 1
4
3
2
1
0
1
2
3
4
5
Late
nt S
pace
Dim
ensi
on 2
EnergyFinance &InsuranceTechnology &TelecomHealthcareLand Use &Gun RightsCivil SocietyMixed Cluster 1Mixed Cluster 2
Figure 5: Latent Space Position vs. Community Membership. We plot paired communitiesof clients and politicians discovered by the dc-biSBM against latent positions from the LSNM. Mostof these communities align well with the industry-level clusters we manually identified in Figure 3.The light gray box highlights a region where two dc-biSBM clusters intersect in the spatial model,which is plotted in greater detail in Figure 6.
politicians in the LSNM energy cluster also belong to a single dc-biSBM cluster (orange triangles).
The “Finance” and “Insurance” clusters, which the LSNM led us to separate, are instead merged
into a single cluster in the dc-biSBM (green squares), with 98% of lobbying clients and 97% of
politicians from both of these LSNM clusters belonging to a single dc-biSBM cluster. Several other
similar relationships may be observed in the figure; simply put, most of the variation captured by
the LSNM can be captured by discrete community assignments.
However, the LSNM and the dc-biSBM do not always depict the same underlying community
structure. Specifically, we find that an alignment of preferences sometimes cannot be summarized in
a single community membership. This can be seen especially clearly in the highlighted rectangular
region in Figure 5, where we observe that some spatially distant political actors belong to the
same dc-biSBM communities, while some belonging to different dc-biSBM communities are spatially
adjacent. To investigate this, we zoom in on the region and display several example clients from
the two most salient communities in Figure 6. First, we find that most of the agents in the
spatial “Environmentalism” and “Gun Rights” clusters, which lie far away from each other in
15
1.0 0.5 0.0 0.5 1.0 1.5 2.0Latent Space Dimension 1
0
1
2
3
4
Late
nt S
pace
Dim
ensio
n 2
Land Use & Gun RightsCivil SocietyMixed Cluster 1Mixed Cluster 2
Land Use & Gun Rights Clients
Ntl. Rifle AssociationNtl. Shooting Sports FoundationEverytown for Gun SafetyNtl. Fisheries InstituteDefenders of WildlifeFarm Animal Welfare CoalitionCropLife AmericaNtl. Milk Producers FederationGrocery Manufacturers Assn.
Civil Society Clients
Mental Health AmericaSenior Citizens LeagueNtl. Women’s Law CenterNAACPAnti-Defamation LeagueACLUAFL-CIOParalyzed Veterans of AmericaNtl. Education AssociationKent State University
Figure 6: New dc-biSBM Clusters. We plot two “issues” and two “mixed” clusters of clientsand politicians discovered by the dc-biSBM against the latent space positions from the LSNM,along with the clusters based on latent positions that we found manually and showed previouslyin Figure 3. We draw the reader’s attention to the fact that these clusters intersect spatially inthe LSNM, a circumstance whose interpretation is discussed in the main text. Some examples ofclients from each of the dc-biSBM issues clusters are given in the tables on the right.
LSNM, are merged into a single dc-biSBM community (grey circles). Instead of an industry-based
grouping, we interpret these as participants in a broad “Land Use” debate, including gun rights
and hunting organizations, agricultural organizations, and environmental conservation groups (The
connection with gun rights appears to arise due to controversy over hunting rights, evidenced by the
membership of organizations like the National Shooting Sports Foundation in this cluster).
Second, the “Military & Veterans’ Affairs” cluster at the top is expanded to include many other
agents below (brown plus signs), who we interpret as participants in a “Civil Society” debate.
Indeed, this cluster includes foundations for the rights of the disabled, senior citizens, women, and
ethnic and racial minorities, as well as veterans’ associations, labor unions, education groups, and
general civil rights advocacy organizations.
This classification notwithstanding, an issue like gun rights touches upon both questions of civil
rights and land use. Thus, an interest group concerned with gun rights might lobby similarly to
16
some interest groups that clearly belong to the “Civil Society” cluster, and similarly to others that
clearly belong to the “Land Use” cluster. Notice that the grey circles and brown plus signs intersect
one another in the middle of Figure 6. We argue that this is an instance of precisely such mixed
interests in the lobbying network: strict classification for some organizations is complicated by the
interplay of interests in multiple arenas of debate. For instance, the National Shooting Sports
Foundation previously mentioned lies near this region.15 Moreover, the two clusters we denoted
“Mixed Cluster 1” and “Mixed Cluster 2” in Figures 5 and 6, so named because there appears to
be no unifying topical description of their members, lie mostly in the central cluster of the LSNM,
the region marked by the dotted circle in Figure 3 which we previously found to contain large
lobbying clients with diverse interests and senior politicians with broad authority. It therefore
appears entirely reasonable that mixed interests are in fact the phenomenon characterizing this
region, an intuition that we will make precise in the following section.
To summarize, with the dc-biSBM we obtain an explicit analysis confirming that there in fact
exist communities in the lobbying network, and that these communities are again not aligned
with one-dimensional ideological polarization, but rather with a rich collection of industry- and
committee-level distinctions. We also find new community structure corresponding not to industries
but to cultural or political debates, and find that for these cases, the non-geometric approach
of community modeling uncovers interesting group memberships that were not clear from the
ideal point models previously used in much of the relevant literature. Finally, examining the
characteristic interests of some of the lobbying clients placed in these clusters, we find that it is
perhaps inappropriate to impose them a “hard” classification into a single community, because
such clients may be best described by a mixture of political interests. In the following section,
therefore, we propose a new community model that represents mixed interests of political actors
in an explicit probabilistic framework.
4.2 The Proposed Bipartite Link Community Model
We propose the Bipartite Link Community Model (biLCM), whose goal is to capture actors’ si-
multaneous membership in several political communities. In the proposed model, political “links”
(edges) between a politician and lobbying client can be manifested by heterogeneous interests. This
methodology allows us to capture the important possibility of politicians’ having multidimensional
preferences involving several issues (Lauderdale and Clark, 2014), as well as interest groups pur-
suing several simultaneous objectives and aggregate interest groups such as industry organizations
accomodating the varied interests of their members.
It is particularly apparent that this is sensible for politicians, who almost always sit on several
15The mean coordinates of this lobbying client’s latent position are (0.33, 2.42).
17
committees having distinct legislative jurisdictions. For lobbying clients, mixed interests of course
still arise, but in several varieties. We have already suggested one example in the previous section,
where a lobbying client like the Sierra Club may maintain interests in, say, an environmental
issue (e.g. lobbying on 113th H.R. 2424, Community Parks Revitalization Act together with the
Trust for Public Land), but may for various reasons also sometimes lobby on broader civil
rights legislation (e.g. lobbying on 113th H.R. 3206, Global Sexual and Reproductive Health Act of
2013 together with Planned Parenthood). Perhaps a simpler instance of the same phenomenon
occurs with larger holding companies, which lobby on behalf of their diverse subsidiaries in multiple
arenas. For instance, the lobbying client Philips Holding USA, Inc., which specializes in medical
equipment, home appliances, and lighting solutions, will split its membership between one link
community focused on healthcare, and another focused on retail and shipping. In fact, Philips
Holding USA, Inc. lies near the center of the LSNM latent space in Figure 3, the region whose
proper interpretation we have suggested in the previous section is linked to mixed interests, and
the Sierra Club lies in the nearby and likewise ambiguous region near the intersection of the two
clusters highlighted in Figure 6.16
Modeling Choices for Mixed Membership In the lobbying network, an even stronger version
of mixed interests arises than the one we have emphasized above: not only do political actors often
have several simultaneous interests, but even one politician and one lobbying client often coincide
on bills that should not be grouped together. This is especially apparent between agents with broad
ranges of interests, like large lobbying clients and senior politicians. For example, the Chamber of
Commerce and Senator Barbara Boxer (D-CA) coincide on both 113th S. 601 “Water Resources
Development Act of 2013” and 113th S. 462 “United States-Israel Strategic Partnership Act of
2013”. Likewise, the Specialty Equipment Market Association (SEMA) and Senator John
Cornyn (R-TX) coincide on both 113th S. 983 “Keep the IRS Off Your Health Care Act of 2013”
and 113th S. 2635 “21st Century Endangered Species Transparency Act”.
Therefore, a natural modeling assumption is that political actors have simultaneous member-
ship in several legislative communities, and they “play the role” of one of those memberships in
each individual interaction that they partake in. The mixed-membership stochastic block model
(mmSBM), introduced by Airoldi et al. (2008), is perhaps the most common alternative to the
type of model we will propose, and involves each actor having an associated probability distribu-
tion over communities, the interaction between a pair of actors depending on a pair of community
assignments that the actors choose for that interaction partner from their respective distributions.
For example, the authors apply the mmSBM to the Sampson monastery dataset (Sampson, 1969),
16The mean coordinates of these lobbying clients’ latent positions are (−0.25, 0.51) and (−0.68, 1.32), respectively.
18
(a) dc-biSBM (b) mmSBM (c) biLCM
Figure 7: Schematic Comparison of Community Models. We illustrate three generativenetwork models, the single-membership dc-biSBM model and the mixed-membership mmSBM andbiLCM models. For the sake of simplicity, the former two models are illustrated in the assortativecase, where each politician community interacts strongly with just one lobbying client communityand vice-versa. (a) In the single-membership model, political actors with a shared communitygroup membership tend to interact more frequently. (b) mmSBM allows for mixed membershipdistributions while all interactions involving a given partner are constrained to be a single type. (c)The proposed biLCM allows for mixed membership as well as mixed interaction (link) types. Thiscaptures the common lobbying interactions wherein interest groups and politicians with diversecommunity memberships interact for different political reasons.
a popular example in the social networks literature which summarizes survey data on social rela-
tionships among 18 novice monks joining a monastery, and identify the group of “waverer” monks,
who do not commit to one of the primary social factions in the monastery, but rather maintain
friendships with members of several factions.
Although this model has some desirable features that allow multiple memberships for political
actors, the mmSBM always retains the property that the mixed membership it models consists
of an agent varying their community membership across their interactions with different agents.
But, as we described before, the mixed membership we are interested in modeling is different—we
are concerned with political actors playing different roles even in their interactions with a single
partner over the course of a session of Congress. In fact, such variation in connections is ubiquitous
in political networks, as the motivating examples above demonstrate. We illustrate this important
conceptual difference graphically with Panels (b) and (c) of Figure 7.17 These show that a model
in the mmSBM family would bizarrely insist that a single common community membership must
account for each pair of incidences involving a given politician and interest group, while in reality,
and as modeled by the biLCM to be introduced, the bills that link a given politician and interest
17We consider an extended version of the original mmSBM to assortative and weighted relationships that are closerto the data we observe in the lobbying network. Mathematically, the difference we are emphasizing is that an mmSBMadjusted to have Poisson-distributed weights on interactions would model each edge weight as a mixture of Poissondistributions, while the link community model we propose will model each edge weight as a sum of independentPoisson distributions, one for each link community in which the agents interact.
19
group often have different underlying motivations and subject matter, and thus should be modeled
as such. (Of course, as Panel (a) of Figure 7 shows, both models are far more flexible than the
dc-biSBM, which makes the assumption that only a single fixed community membership is allowed
for each actor.)
The Model Our model combines three features which, to the best of our knowledge, have not
been previously combined in the mixed membership community modeling literature. First, it ex-
plicitly models link communities; second, it does so in the bipartite setting; and finally, it uses
a statistically principled generative model. Our model structure is most similar to that of Ball,
Karrer, and Newman (2011), but borrows the ideas for specialization to bipartite graphs from Lar-
remore, Clauset, and Jacobs (2014). The most similar work we are aware of is the link community
detection task in bipartite graphs treated as an optimization problem on a graph and solved by an
ad hoc genetic algorithm by Li, Zhang, and Zhang (2015); unlike that work, our model provides
an underlying probabilistic model and therefore a natural statistical interpretation.
We suppose that there are k link communities, which we will always take as indexed by a
variable z ∈ [k]. To emphasize the political interpretation, we will subsequently refer to these as
legislation communities as well. Each client and politician has a vector of parameters αi,z and
βj,z, respectively, which represent their involvement in legislation belonging to community z. The
number of bills lobbied by client i, sponsored by politician j, and belonging to legislation community
z is modeled as Poisson with a mean proportional to αi,zβj,z. To resolve the identification issue,
we assume that for each fixed z,∑m
i=1 αi,z =∑n
j=1 βj,z = 1, and introduce another parameter κz
to capture the overall weight of group z, so that the number of bills between client i and politician
j in legislation community z has mean κzαi,zβj,z. We assume that these Poisson variables are
independent, so that the model for the incidence matrix entries is
Ai,j ∼ Poisson
∑z∈[k]
κzαi,zβj,z
, (6)
and the joint distribution is
P(A | α,β,κ) =
m∏i=1
n∏j=1
Poisson
(Ai,j
∣∣∣∣ k∑z=1
κzαi,zβj,z
). (7)
Computation We derive an Expectation-Maximization (EM) algorithm for this model in Ap-
pendix A.5, which involes alternating expectation and maximization update steps until the log-
likelihood of the model converges. The update equations produced by our derivation are given
below, including ancillary optimization parameters qi,j(1), . . . , qi,j(k) (the first equation is the ex-
20
pectation step for the ancillary parameters, and the last three equations are maximization steps
for the model parameters).
qi,j(z) =κzαi,zβj,z∑kz=1 κzαi,zβj,z
, (8)
κz =
∑mi=1
∑nj=1Ai,jqi,j(z)∑m
i=1
∑nj=1 αi,zβj,z
, (9)
αi,z =
∑nj=1Ai,jqi,j(z)∑m
i=1
∑nj=1Ai,jqi,j(z)
, (10)
βj,z =
∑mi=1Ai,jqi,j(z)∑m
i=1
∑nj=1Ai,jqi,j(z)
. (11)
As for the dc-biSBM, this procedure is not guaranteed to converge to the parameters maximizing
the likelihood, so we take the parameters obtained from 50 randomly initialized runs and choose
those that attain the highest likelihood value.
Empirical Findings The majority of our analysis of the biLCM will focus on how “spread out”
agents’ distributions over (link) communities are, or how “mixed” the legislative activity of an
agent is between different legislation communities. We first describe a simple way to quantify this
notion. From the model definition, the mean total number of times client i lobbies in community
z equals κzαi,z, and likewise the mean total number of times politician j sponsors in community
z equals κzβj,z. A natural choice of measurement is then to normalize these quantities to form
probability distributions pi,z =κzαi,z∑k
z=1 κzαi,zand qi,z =
κzβj,z∑kz=1 κzβj,z
, and then consider the (Shannon)
entropies Hi = H(pi,1, . . . , pi,k) and Hj = H(qj,1, . . . , qj,k), where H(c1, . . . , ck) = −∑ cz log2 cz.18
In computing entropies, we take logarithms base 2 for the sake of interpretability: an entropy of
H may then be interpreted as, roughly speaking, an agent typically participating in 2H link com-
munities. Simply put, the higher an agent’s entropy, the more “spread out” their link community
distribution is.
With these tools, the first phenomenon the biLCM model can shed light on is the properties
of the region near the center of the LSNM latent space, where we did not find legible community
structure previously. The left panel of Figure 8 repeats the latent space plot from Figure 3, but
now dividing the latent space into small hexagonal parcels and coloring these by the mean link
community membership entropy of agents lying within them. We clearly observe actors of more
mixed membership (darker hexagons) clustered near the center of the latent space. As a more
intuitive visualization, we provide a schematic visual representation of the same phenomenon in
18We reuse the letter H for these quantities since it will always be clear from context whether we are discussing theentropy of specifically clients or specifically politicians. The astute reader may also object at this point that in facta more principled quantity to consider would be the mean of the entropy of empirical link distributions drawn fromour model with a given set of parameters. We concur, but find in practice that this mean agrees almost perfectly(Pearson’s ρ = 0.99) with the quantity we compute.
21
4 2 0 2 4Latent Space Dimension 1
4
3
2
1
0
1
2
3
4
5
Late
nt S
pace
Dim
ensio
n 2
Link Community Distribution Zonal Entropies
4 2 0 2 4Latent Space Dimension 1
Link Community Distribution Examples
Figure 8: Latent Space Position vs. Link Community Distribution. The left panel dividesthe latent space inferred for the full 113th Congress dataset into hexagonal regions (keeping onlythose containing some clients or politicians). We color each region by the average entropy oflink community memberships of the agents in that region, such that a darker color implies agents’participation in multiple communities. The right panel plots example link community distributionsat their corresponding latent positions. We observe that the distributions become more skewedtowards single communities as we move to the edges of the latent space, whereas actors with themost “spread out” link community distribution tend to lie in the center.
the right panel, where we choose just a few example lobbying clients and represent each with a
pie chart showing their estimated link community membership distribution. We consistently find
many of the same industry-specific communities at the edges of the latent space as we have seen
before, such as the energy, technology, and finance industries, whose members typically participate
in only one or two link communities (marked by pie charts with only one or two dominant colors).
And, we again find that political actors in the center region tend to have mixed membership among
many different political communities (marked by pie charts split evenly among many colors).
We have already seen in Figure 4 that the politicians in the central LSNM cluster tend to belong
to “procedural” committees, having broad and less domain-specific responsibilities and involving
general categories of government oversight or budgetary allocation. The lobbying clients in this
group also admit a simple and politically meaningful description: many of them are large associ-
ations collectively representing many small businesses, employee groups, or broad causes, such as
the Chamber of Commerce, the Specialty Equipment Market Association, Heritage
Action for America, the National Treasury Employees Union, and numerous others.
Based on these characterizations and the observation that such political actors also exhibit very
mixed community membership, we propose that these actors participate in lobbying through ag-
gregate action in the lobbying marketplace, either in the form of direct lobbying of politicians with
22
broad responsibilities (as indicated by their membership in procedural committees), or in the form
of lobbying through large associations by many individuals or smaller firms as often observed in
trade politics. We believe that these findings indicate a unique aggregate mode of lobbying, wherein
politicians with control over broad ranges of policy are lobbied by many otherwise unrelated inter-
est groups, and conversely lobbying clients representing broad ranges of smaller constituents lobby
many otherwise unrelated politicians. We compare some examples of the lobbying clients operating
in this manner to others whose lobbying is more focused in Table 2.
How do we know whether a given political actor’s link community distribution is unusually
concentrated or mixed? To provide an answer, we perform a permutation test on the lobbying
network to obtain a null distribution of “mixedness” (measured by entropy) of the biLCM results
for networks with similar connectivity to the lobbying network (see Appendix A.6 for technical
details). Since the biLCM can capture both concentrated and mixed community membership, this
is also a useful test of the statistical significance of the communities we have been discussing: if the
lobbying network has strong community structure compared to a “generic” similar network, then
the biLCM should find much more concentrated community memberships in the lobbying network
than in such a generic network. Indeed, we find that this is the case, observing that the link
community distributions in the null model are typically much more mixed than those in the model
of the lobbying network.
We also observe that, in addition to elucidating aggregate lobbying, mixed membership gives
a clean way to explain many of the questions raised by our single membership clusterings previ-
ously. A simple example is Philips Holding USA, Inc., which as we noted before manufactures
both healthcare products and electronics. Accordingly, it has a probability of participation in the
“Healthcare” cluster of 0.31, and in the “Retail / Shipping” cluster of 0.26 (and these are the
two clusters it is most likely to participate in). Similarly, CropLife America has a probability
of participation in the “Retail / Shipping” cluster of 0.74, and in the “Energy” cluster of 0.23.
The biLCM also finds a very cleanly delineated “Civil Society” cluster, which clarifies some of the
unusual findings around the corresponding cluster of the dc-biSBM. For instance, as we inferred
previously, several gun rights groups (the National Rifle Association, the National As-
sociation for Gun Rights, and Gun Owners of America, Inc.) indeed have the highest
probability of participating in the “Land Use” cluster, but also a small additional probability of
participating in the “Civil Society” cluster, confirming our interpretation that the motivation of
these groups to lobby is divided between an issue-driven interest in gun rights and an ideological
commitment to general conservatism on cultural questions. The same occurs for the Sierra Club,
which typically takes the opposite side of the gun rights debate from these organizations.
23
Lobbying Client Distribution Example Bills
Berkshire Hathaway EnergyEnergy Freedom and Economic Prosperity ActGeothermal Production Expansion ActHydropower Regulatory Efficiency Act
Microsoft CorporationStartup Act 3.0Cybersecurity Enhancement ActSurveillance Transparency Act
Sierra ClubTennessee Wilderness ActKeystone XL Pipeline Approval ActGlobal Sexual and Reproductive Health Act
Philips Holding USA
Integrated Electronic Health Records forMilitary and Veterans Act
Energy Savings and IndustrialCompetitiveness Act
Domestic Prosperity and Global Freedom Act
Chamber of CommerceSTEM Visa ActUnited States-Israel Strategic Partnership ActSensible Accounting to Value Energy Act
Heritage Action for AmericaMarriage and Religious Freedom ActSave American Workers ActFree Flow of Information Act
Table 2: Link Community Distribution Examples. We give examples of lobbying clients withfocused, intermediate, and very spread out link community distributions, along with examples ofbills illustrating their interests. We observe some consistencies among the communities emphasizedfor different clients; for instance, Berkshire Hathaway Energy and the Sierra Club share aninvolvement in an energy-focused community, while Philips Holding USA and the Chamberof Commerce share an involvement in a retail- and shipping-focused community.
On the whole, we find that the biLCM successfully recovers a readily interpretable mixed mem-
bership profile, where a lobbying client and a politician must balance the pursuit of several distinct
political commitments. That is, the proposed methods not only capture the community structure
that we often do not observe, but also provide a useful guidance for gauging the mixed-interests
and complex preference aggregation that drive lobbying in legislative politics.
5 Concluding Remarks
Lobbying is known to be an important channel through which special interest groups affect the
U.S. legislative process. Even so, explicit observable connections between interest groups and
the politicians who represent those interests have proved elusive, since the individual politicians
contacted in the course of lobbying need not be disclosed. We assemble a new lobbying network
dataset by measuring legislative interactions between interest groups and politicians across all bills
introduced since the 106th Congress, after carefully identifying the lobbying instances associated
with each bill.
24
We show that this network can be usefully modeled by both latent space models and community
models. Unlike previous applications of latent space models in political science under the rubric
of ideal point estimation, our models suggest that lobbying interactions mostly depend not on
an underlying ideological spectrum, but rather on an underlying grouping into domain-specific
communities. Stochastic block models confirm and clarify this finding by explicitly modeling the
community structure we empirically observe. Furthermore, our bipartite link community model
captures a richer network structure in which some politicians have a tight connection with a single
industry specifically related to their committee membership, while others who serve in “procedural”
committees with broader subject matter are linked to multiple political communities and aggregate
interest groups that represent highly heterogeneous political interests. The consistency between
latent space models and non-geometric community models in describing these patterns provides
strong evidence for political divisions that are not aligned with ideological polarization as the
dominant structural feature of the lobbying network in U.S. legislative politics.
Our findings open up new possibilities for studying the mechanics of lobbying, showing that
it is helpful to view lobbying activity as occurring in a network of political actors, and therefore
suggesting that other network analysis techniques may reveal further meaningful patterns underly-
ing lobbying. Of course, the lobbying network also contains much more information than we have
analyzed here, including bills and their contents, lobbyists and their lobbying firm affiliations, dis-
closures of money spent on lobbying, and the timing of lobbying and legislative events. Extending
the network models in this work to describe new types of agents and interactions should further
enhance the picture of lobbying and its political effects that we have obtained here.
25
References
Airoldi, Edoardo M., David M. Blei, Stephen E. Feinberg, and Eric P. Xing. 2008. “Mixed Mem-bership Stochastic Blockmodels.” Journal of Machine Learning Research 9.
Ansolabehere, Stephen, James M Snyder, and Micky Tripathi. 2002. “Are PAC Contributions andLobbying Linked? New Evidence from the 1995 Lobby Disclosure Act.” Business and Politics4 (2): 131–155.
Austen-Smith, David. 1995. “Campaign Contributions and Access.” American Political ScienceReview 89 (3): 566–581.
Austen-Smith, David, and John R Wright. 1992. “Competitive Lobbying for a Legislator’s Vote.”Social Choice and Welfare 9 (3): 229–257.
Bafumi, Joseph, Andrew Gelman, David K Park, and Noah Kaplan. 2005. “Practical Issues inImplementing and Understanding Bayesian Ideal Point Estimation.” Political Analysis 13: 171–187.
Bailey, Michael A, Anton Strezhnev, and Erik Voeten. 2017. “Estimating dynamic state preferencesfrom United Nations voting data.” Journal of Conflict Resolution 61 (2): 430–456.
Ball, Brian, Brian Karrer, and M.E.J. Newman. 2011. “An efficient and principled method fordetecting communities in networks.” Physical Review E 84.
Barbera, Pablo. 2014. “Birds of the Same Feather Tweet Together: Bayesian Ideal Point EstimationUsing Twitter Data.” Political Analysis 23 (1).
Bauer, Raymond, Lewis Anthony Dexter, and Ithiel de Sola Poll. 1972. American Business andPublic Policy: The Politics of Foreign Trade. New York: Aldine.
Baumgartner, Frank R, Jeffrey M Berry, Marie Hojnacki, Beth L Leech, and David C Kimball.2009. Lobbying and Policy Change: Who Wins, Who Loses, and Why. University of ChicagoPress.
Bertrand, Marianne, Matilde Bombardini, and Francesco Trebbi. 2014. “Is It Whom You Know orWhat You Know? An Empirical assessment of the Lobbying Process.” The American EconomicReview 104 (12): 3885–3920.
Bombardini, Matilde, and Francesco Trebbi. 2012. “Competition and Political Organization: To-gether or Alone in Lobbying for Trade Policy?” Journal of International Economics 87 (1):18–26.
Bond, Robert, and Solomon Messing. 2015. “Quantifying Social Medias Political Space: EstimatingIdeology from Publicly Revealed Preferences on Facebook.” American Political Science Review109 (1): 62–78.
Bonica, Adam. 2013. “Mapping the Ideological Marketplace.” American Journal of Political Science58 (2): 367–386.
Carpenter, Bob, Andrew Gelman, Matthew D. Hoffman, Daniel Lee, Ben Goodrich, Michael Betan-court, Marcus Brubaker, Jiqiang Guo, Peter Li, and Allen Riddell. 2017. “Stan: A ProbabilisticProgramming Language.” Journal of Statistical Software 76 (1).
Clark, Tom S, and Benjamin Lauderdale. 2010. “Locating Supreme Court Opinions in DoctrineSpace.” American Journal of Political Science 54 (4): 871–890.
26
Clinton, Joshua, Simon Jackman, and Douglas Rivers. 2004. “The Statistical Analysis of Roll CallData.” American Political Science Review 98 (2): 355–370.
De Figueiredo, John M, and Brian Kelleher Richter. 2014. “Advancing the Empirical Research onLobbying.” Annual Review of Political Science 17: 163–185.
Faccio, Mara. 2006. “Politically Connected Firms.” The American economic review 96 (1): 369–386.
Faccio, Mara, Ronald W Masulis, and John McConnell. 2006. “Political Connections and CorporateBailouts.” The Journal of Finance 61 (6): 2597–2635.
Fortunato, Santo. 2010. “Community detection in graphs.” Physics reports 486 (3-5): 75–174.
Fowler, James H. 2006. “Connecting the Congress: A study of Cosponsorship Networks.” PoliticalAnalysis 14 (4): 456–487.
Grossman, Gene M, and Elhanan Helpman. 2001. Special Interest Politics. Cambridge, MA: MITpress.
Hafner-Burton, Emilie M, Miles Kahler, and Alexander H Montgomery. 2009. “Network Analysisfor International Relations.” International Organization 63 (3): 559–592.
Hall, Richard L, and Alan V Deardorff. 2006. “Lobbying as legislative subsidy.” American PoliticalScience Review 100 (1): 69–84.
Hertel-Fernandez, Alexander. 2014. “Who passes businesss model bills? Policy capacity and cor-porate influence in US state politics.” Perspectives on Politics 12 (3): 582–602.
Hoff, Peter D, Adrian E Raftery, and Mark S Handcock. 2002. “Latent Space Approaches to SocialNetwork Analysis.” Journal of the american Statistical association 97 (460): 1090–1098.
Kang, Karam. 2015. “Policy Influence and Private Returns from Lobbying in the Energy Sector.”The Review of Economic Studies 83 (1): 269–305.
Kang, Karam, and Hye Young You. 2017. “The Value of Connections in Lobbying.” Working pa-per available at https://hyeyoungyou.files.wordpress.com/2015/08/value_of_connections1.pdf.
Keck, Margaret E, and Kathryn Sikkink. 1998. Activists Beyond Borders: Advocacy Networks inInternational Politics. Cornell University Press.
Khwaja, Asim Ijaz, and Atif Mian. 2005. “Do Lenders Favor Politically Connected Firms? RentProvision in an Emerging Financial Market.” The Quarterly Journal of Economics 120 (4):1371–1411.
Kim, In Song. 2017. “Political Cleavages within Industry: Firm-level Lobbying for Trade Liberal-ization.” American Political Science Review 111 (1): 1–20.
Kluger, Yuval, Ronen Basri, Joseph T. Chang, and Mark Gerstein. 2003. “Spectral Biclustering ofMicroarray Data: Coclustering Genes and Conditions.” Genome Research 13 (4): 703–716.
Krivitsky, Pavel N, Mark S Handcock, Adrian E Raftery, and Peter D Hoff. 2009. “Representingdegree distributions, clustering, and homophily in social networks with latent cluster randomeffects models.” Social Networks 31: 204–213.
Larremore, Daniel B., Aaron Clauset, and Abigail Z. Jacobs. 2014. “Efficiently Inferring Commu-nity Structure in Bipartite Networks.” Physical Review E 90.
27
Lauderdale, Benjamin E, and Tom S Clark. 2014. “Scaling Politically Meaningful DimensionsUsing Texts and Votes.” American Journal of Political Science 58 (3): 754–771.
Li, Zhenping, Shihua Zhang, and Xiangsun Zhang. 2015. “Mathematical Model and Algorithm forLink Community Detection in Bipartite Networks.” 05 (01): 421-434.
Maoz, Zeev, Ranan D Kuperman, Lesley Terris, and Ilan Talmud. 2006. “Structural Equivalenceand International Conflict: A Social Networks Analysis.” Journal of Conflict Resolution 50 (5):664–689.
Mayhew, David R. 1974. Congress: The Electoral Connection. Yale University Press.
Nourse, Victoria, and Jane S Schacter. 2002. “The Politics of Legislative Drafting: A CongressionalCase Study.”.
Poole, Keith T, and Howard L Rosenthal. 2011. Ideology and Congress. Vol. 1 Transaction Pub-lishers.
Potters, Jan, and Frans Van Winden. 1992. “Lobbying and Asymmetric Information.” Public choice74 (3): 269–292.
Richter, Brian Kelleher, Krislert Samphantharak, and Jeffrey F Timmons. 2009. “Lobbying andTaxes.” American Journal of Political Science 53 (4): 893–909.
Sampson, Samuel F. 1969. “A novitiate in a period of change: An experimental and case study ofsocial relationships.”.
Schickler, Eric. 2001. Disjointed pluralism: Institutional innovation and the development of the USCongress. Princeton University Press.
Shepsle, Kenneth A., and Barry R. Weingast. 1987. “The Institutional Foundations of CommitteePower.” American Political Science Review 81 (1): 85104.
Shor, Boris, and Nolan McCarty. 2011. “The Ideological Mapping of American Legislatures.”American Political Science Review 105 (3): 530–551.
Sinclair, Barbara. 2016. Unorthodox lawmaking: New legislative processes in the US Congress. CQPress.
Slapin, Jonathan B, and Sven-Oliver Proksch. 2008. “A Scaling Model for Estimating Time-seriesParty Positions from Texts.” American Journal of Political Science 52 (3): 705–722.
Snijders, Tom AB, and Krzysztof Nowicki. 1997. “Estimation and Prediction for Stochastic Block-models for Graphs with Latent Block Structure.” Journal of classification 14 (1): 75–100.
Vidal, Jordi Blanes I, Mirko Draca, and Christian Fons-Rosen. 2012. “Revolving Door Lobbyists.”The American Economic Review 102 (7): 3731–3748.
Ward, Michael D, Katherine Stovel, and Audrey Sacks. 2011. “Network Analysis and PoliticalScience.” Annual Review of Political Science 14: 245–264.
Wright, John R. 1990. “Contributions, Lobbying, and Committee Voting in the US House ofRepresentatives.” American Political Science Review 84 (02): 417–438.
Wright, John R. 1996. Interest Groups and Congress. Boston: Allyn and Bacon.
You, Hye Young. 2017. “Ex Post Lobbying.” The Journal of Politics 79 (4): 1162-1176.
28
A Supplementary Appendix
A.1 Identifying Bills and Missing Congress Numbers
Identifying congressional bills in lobbying reports is difficult because bill numbers are repeated
across Congresses, and often do not appear directly annotated with Congress numbers in lobbing
reports. Using the report filing year to guess the Congress often leads to erroneous matches,
because reports filed at the beginning of a new Congress tend to include disclosures of lobbying
activities from the previous year (and therefore, if a new Congress has begun recently, from the
previous Congress as well). For example, consider the following lobbying report filed by Google,
Inc. in 2013. It reads:
Monitor legislation regarding online privacy including Safe Data Act (H.R. 2577, S. 1207) and Do not trackproposals (H.R. 654). Monitor any Congressional or Administration efforts to impose privacy laws on searchengines. Monitor Spectrum acts (S. 911, H.R. 2482).
Figure A.1: First Quarter Report by Google, Inc. in 2013
A naive guess would be that the House bill H.R.2577 refers to a bill from the 113th Congress,
because the report was filed in 2013. However, it is clear from the report that this is a 112th
Congress bill, the “SAFE Data Act”. We use the following strategies to mitigate this problem and
correctly identify Congress session numbers under various circumstances.
1. Bill Number Search: We first identify bill numbers (e.g., H.R.2577 above) using regular
expression search in the report text. In the above example in Figure A.1, our algorithm
would identify bill numbers H.R. 2577, S.1207, H.R.654, S.911, and H.R.2482. Note
that all of these bills are from the 112th Congress rather than the 113th.
2. Congress Identification: Given a bill number found in a specific issue text (a section of
the lobbying report), we attempt to identify the most likely Congress to which that bill would
belong using other text around the bill number. Starting from the Congress containing the
year of the lobbying report, we consider a range of candidate Congresses extending backwards
from that of the lobbying report (by default, we consider three Congresses back; therefore
in the above example, we would consider the 113th, 112th, and 111th Congresses). We then
obtain the bills of the same number from each of these Congresses (omitting the Congresses
that do not have a bill of the given number), and compute a bag-of-words representation
(after a tokenization and stopword filtering pipeline) of each of those bills, giving some vectors
v1, . . . ,vn, and the same representation of the text around the mention of the bill number,
w. The Congress that we choose then corresponds to the maximizer of the cosine similarity,
29
its index given by
i∗ = argmax1≤i≤n
v>i w
‖vi‖‖w‖. (12)
If no bill having the same number exists in the entire range of Congresses we consider, we
simply guess that the bill comes from the Congress of the year the lobbying report was filed.
3. Congress Propagation: If we successfully find a match for a Congress, it may be propa-
gated to the other bills mentioned in the lobbying report, since, being scheduled on a quarterly
basis, lobbying report will effectively always only pertain to legislation in a single Congress.
If different bills in a lobbying report disagree on the best-match Congress, a majority vote
may be taken, but this rarely occurs in practice.
4. Bill Title Search: Bills are sometimes only referred to by titles or alternate names. To
account for this, we clean and tokenize the specific issue sections of the lobbying report, and
perform a text matching operation against a table of bill titles. For instance, this operation
would identify “Safe Data Act” in our previous example, even if the bill number H.R.2577
were not mentioned.
5. Bill Range Expansion: It is also common for bills with nearby numbers to be related, and
for lobbying reports to refer to ranges of bills when they are all being lobbied at once.
“H.R.3009, Trade Act of 2002. Certain miscellaneous tariff bills to suspend the rates ofduty on certain toy-related articles (H.R.4182-4186; S.2099-2103). WTO market accessnegotiations for non-agricultural products Port and border security measures”
Figure A.2: Midyear Report by Mattel, Inc. in 2002
Therefore, if we find two bill numbers that are close (by default if they share the same prefix
and their numbers differ by at most 10), then we consider all other bills with numbers in be-
tween as also being lobbied. For instance, the pattern H.R.4182-4186 in the excerpt shown
in Figure A.2 would be expanded into bills H.R.4182, H.R.4183, H.R.4184, H.R.4185,
and H.R.4186 all being considered lobbied.
A.2 Assembling Datasets
A.2.1 Filtering
Besides reducing the size of our data, the key task in filtering is to increase the minimum degree
of the agents we include. This is especially important for latent space models, which will be the
class of models we consider in the greatest detail; it is intuitive that there is no principled way to
position an actor in the lobbying marketplace if the actor does not lobby or sponsor very much,
30
or at all. Thus, while it may appear appealing at first to filter the dataset independently on the
client and politician side by separate summary statistics like total numbers of bills sponsored or
lobbying reports submitted during a Congress, such filtering is not necessarily aligned with the
goal of finding a large submatrix of A (or induced subgraph of G) with only agents of sufficiently
high degree. In particular, we want to avoid sponsor filtering causing some clients who survived
client filtering to have their degree reduced again, or vice-versa.
We find that the simplest way to avoid such issues is to build the full matrix A without filtering
first, and then filter based only on incidences in a way that explicitly ensures that only high-degree
agents remain. To that end, we use a filtering procedure defined by two thresholds, denoted TL and
TP throughout, which alternates removing all clients with degree lower than TL and removing all
politicians degree lower than TP , until no more clients or politicians are removed in a full iteration.
This is a simple greedy algorithm for finding an induced subgraph of G with only high-degree
nodes, which we find to suffice for our purposes; almost always just a few iterations are required
for the algorithm to terminate, but rarely just one iteration.
A.2.2 Specifications
Each of our datasets pertains to a single Congress, and will be identified by the number of the
Congress, along with one of the following specifications.
• Unthresholded: All politicians in both chambers of Congress any of whose sponsored bills
have been lobbied on are included, and all clients who have lobbied on any bills are included.
• Full: The unthresholded dataset, filtered with the algorithm from XXX with thresholds
TL = 100 and TP = 10. (The name reflects that this dataset contains both chambers of
Congress unlike the per-chamber datasets below; the unthresholded dataset will only be
considered briefly at the beginning of our analysis and should not be confused with this
dataset, which will play a more important role.)
• Senate or House: The full dataset, restricted to include only politicians in the Senate or
House of Representatives, respectively, and then filtered again with TL = 20 and TP = 10
(this last filtering only has a very small effect on the dataset size, and serves only to exclude
a few very low-weight rows and columns from the restricted incidence matrix).
• Top Term: The same construction as the full dataset, but including only those bills whose
top term is the specified term (thereby restricting bills to a certain issue area). We tune the
thresholds used for these datasets to give dataset sizes roughly similar to those of the other
datasets. For the “Energy” top term, we take TL = 5, TP = 5.
31
Congress Specification Clients Politicians Sparsity
113 Unthresholded 6,747 542 98.28%Full 707 525 91.99%Senate 701 111 95.15%House 707 414 93.79%Top Term “Energy” 518 101 95.03%
111 Senate 945 118 86.27%House 949 410 94.26%
Table A.1: Datasets. The specification, shape, and sparsity (fraction of zero entries) of all of thedatasets used in our analysis.
0 50 100 150 200+Bills Lobbied
0.00
0.02
0.04
0.06
0.08
0.10
0.12
Prop
ortio
n of
Lob
byin
g Cl
ient
s
Bills Lobbied per Client
0 20 40 60 80 100Bills Sponsored
Prop
ortio
n of
Pol
iticia
ns
Bill Sponsored per Politician
Lobbying Client Bills LobbiedIraq/Afghanistan Veterans of America 725Chamber of Commerce 684National Cable and Telecom Assn. 312Berkshire Hathaway Energy 307Xcel Energy 292SEMA 290American Civil Liberties Union 286Duke Energy 283Integrys Energy Group 273Edison Electric Institute 253
Politician Bills SponsoredRobert Menendez 107Alan Grayson 96Mark Begich 92David Vitter 85Harry Reid 80Amy Klobuchar 78Sherrod Brown 69Bernard Sanders 69Dianne Feinstein 69Charles Schumer 67
Figure A.3: Distribution of Political Actions: We present the skewed distributions of politiciansponsorship and lobbying counts for the 113th Congress dataset. The activity counts and namesof the most active politicians and clients are presented in the tables.
Lastly, we refer in shorthand to the full 113th Congress dataset as the “main dataset”, since for
models other than latent space models we will present findings only for this dataset, our interest
in those models being primarily methodological comparison. More generally, most of our analyses
will focus on the 113th Congress, with the 111th Congress referenced for comparison. Table A.1
shows basic statistics for all datasets we will use.
32
A.3 Latent Space Model Details
A.3.1 Dimensionality Selection
We present some heuristics on choosing a dimensionality for the LSNM based on a simpler modeling
technique, the spectral biclustering of Kluger et al. (2003). In this method, one computes the
singular value decomposition (SVD) of the bipartite adjacency matrix A, and searches for piecewise
constant structure among the leading singular vectors. The idea we will borrow from this method
is simply that the leading singular vectors contain most of the relevant information for the analysis
of A, proportionally to the magnitude of their singular values, essentially the same concept as in
PCA analyses. We then make a plot of the singular values, analogous to a scree plot, and search
for an “elbow” indicating the suitable number of singular vectors to use. That number gives an
estimate of the dimensionality of the low-rank structure in A, and thus also a heuristic estimate
of the dimensionality to take in the LSNM. In A.4, we show that a dimensionality of two is a
reasonable choice if this heuristic is admitted.
0 100 200 300 400 500Index
0
1
2
3
4
5
6
Squa
red
Sing
ular
Val
ue
1e5
Figure A.4: Bipartite Adjacency Matrix Singular Value Decay. We plot the singular valuesof the matrix A for the 113th Congress dataset, highlighting that the singular values decay rapidlyand the first few appear to capture much of the total weight of this matrix. The first two singularvalues are highlighted, corresponding to our analysis primarily of a two-dimensional latent spacemodel in the main text.
A.3.2 Computation
The Stan code in Listing A.1 was used to fit our latent space models, run through the PyStan
interface. In practice, by far the most important optimization for this model is the vectorization
of the declaration of the Poisson distribution for edge weights; factorizing the latent space posi-
33
tion prior covariance matrix into Cholesky factors and using them to transform standard normal
variables is a common optimization for multivariate normal priors, but our covariance matrix is so
small that we find this not to affect (or even to degrade, in some cases) sampling performance.
For each latent space model we run four MCMC chains, each drawing 10,000 samples, the first
4,000 of which are discarded as warmup (“burn-in”) samples, leaving us with 6,000 usable samples
per chain, for a total of 24,000 samples, which are used to compute estimates of posterior means
of all of our models parameters as used in all visualizations in the main text.
A.3.3 MCMC Diagnostics
Position Variances We employ a useful visualization that captures the variance structure of
all of our latent space position estimates at once. For any agent, if we take N samples of its
d-dimensional position, we may view these samples as the columns of a matrix X ∈ Rd×N . We
let µ ∈ Rd be the vector of means of the rows of X, and let X0 be the centered matrix obtained
by subtracting µi from each entry of the ith row of X. Then, W = 1N−1X0X
>0 ∈ Rd×d is the
empirical covariance matrix of the position samples, which we diagonalize to obtain orthonormal
eigenvectors v1, . . . ,vd and associated non-negative eigenvalues λ1, . . . , λd. This is essentially a
principal component analysis (PCA), except unlike the usual high-dimensional setting for PCA,
we actually have many points in a low-dimensional space, N � d. Nonetheless, the computed
quantities have the same interpretations: λi is the amount of variance of the dataset of sampled
positions in the direction of principal component vi, and the shape of the set of sampled positions
is approximated by the ellipsoid with axes in the directions of the vi and having lengths√λi (one
“directional standard deviation” in each principal direction).
For small d, in particular for d = 2 as we use for most of our models, these ellipses can be drawn
for all embedded points at once, and their scale is easy to understand visually relative to the scale
of the entire latent space. Small ellipses correspond to concentrated posterior distributions for
position parameters, and concentrated posteriors justify the use of the mean position point in our
other visualizations (also, though we did not use such algorithms, concentrated posteriors would
justify safe use of inferential approximations such as EM algorithms and variational methods).
This visualization is given for the model inferred from the main dataset in Figure A.6.
34
64
20
24
Late
nt S
pace
Dim
ensi
on 1
1050510
Latent Space Dimension 2
Polit
icia
ns (D
emoc
rat)
Polit
icia
ns (R
epub
lican
)Lo
bbyi
ng C
lient
s
Cona
gra
Food
sNt
l. Ch
icke
n Co
unci
lNt
l. Tu
rkey
Fed
erat
ion
Amer
ican
Mea
t Ins
titut
eUt
d. F
resh
Pro
duce
Ass
n.Cr
acke
r Bar
rel O
ldCo
untr
y St
ore
Intl.
Dai
ry F
oods
Ass
n.
Stev
e W
omac
k (R
-AR)
Roge
r Wic
ker (
R-M
S)F.
Sen
senb
renn
er
(R-W
I)Bo
b Go
odla
tte
(R-V
A)
Food
Ford
Mot
or C
ompa
nyAl
lianc
e of
Aut
omob
ileM
anuf
actu
rers
Hyun
dai /
Kia
N/A
Auto
mot
ive
Ntl.
Corn
Gro
wer
s As
sn.
Amer
ican
Coa
litio
n fo
rEt
hano
lPo
et L
LCPa
trio
t Ren
ewab
le F
uel
Ntl.
Biod
iese
l Boa
rdAd
vanc
ed B
iofu
els
Assn
.
Dav
id L
oebs
ack
(D-I
A)
Rene
wab
le F
uels
Shel
l Oil
Com
pany
BP A
mer
ica
Exxo
n M
obil
Corp
.Ch
evro
n US
AM
arat
hon
Petr
oleu
m
J. Br
idge
nstin
e (R
-OK)
Don
You
ng (R
-AK)
Oil
Ucor
e Ra
re M
etal
sEl
ectr
on E
nerg
yGr
eat W
este
rn T
ech.
Thom
as a
nd S
kinn
erLy
nas
Corp
.
N/A
Rar
e Ea
rth
Met
als
Duk
e En
ergy
NV E
nerg
ySo
uthe
rn C
ompa
nyEd
ison
Ele
ctric
Inst
itute
Natio
nal G
ridPu
blic
Ser
vice
Ent
erpr
ise
Grou
p (P
SEG)
Bill
Shus
ter (
R-PA
)M
ike
Mul
vane
y (R
-SC)
Ben
Luja
n (D
-NM
)Ba
rbar
a Bo
xer (
D-C
A)Se
an M
alon
ey (D
-NY)
Uti
litie
s &
Hol
ding
s
Allia
nce
Pipe
line
Amer
ican
Nat
ural
Gas
Allia
nce
Spec
tra
Ener
gyUn
ited
Phos
phor
usCe
nter
Poin
t Ene
rgy
Will
iam
s Co
mpa
nies
Hear
th, P
atio
, and
Barb
ecue
Ass
n.
Mik
e Po
mpe
o (R
-KS)
Tom
Mar
ino
(R-P
A)Fr
ed U
pton
(R-M
I)Jo
hn W
alsh
(D-M
T)
Nat
ural
Gas
Amer
ican
Riv
ers
Cova
nta
Ener
gyNo
rthw
est R
iver
Par
tner
sNa
tiona
l Wat
er R
esou
rces
Assn
.Nt
l. Hy
drop
ower
Ass
n.
John
Bar
rass
o (R
-WY)
Scot
t Tip
ton
(R-C
O)
Orr
in H
atch
(R-U
T)Jo
n Te
ster
(D-M
T)Jo
hn L
arso
n (D
-CT)
Hyd
ropo
wer
Arch
Coa
lPe
abod
y En
ergy
Unite
d M
ine
Wor
kers
of
Amer
ica
New
mon
t Min
ing
Co.
Bitu
min
ous
Coal
Ope
rato
rs A
ssn.
J. Ro
ckef
elle
r (D
-WV)
Heid
i Hei
tkam
p (D
-ND
)Ni
ck R
ahal
l (D
-WV)
Coa
l & M
inin
gUS
Gre
en B
uild
ing
Coun
cil
Hone
ywel
l Int
l.Nt
l. As
sn. o
f Hom
eBu
ilder
sW
indo
w a
nd D
oor
Man
ufac
turin
g As
sn.
Air C
ondi
tioni
ng,
Heat
ing,
and
Re
frig
erat
ion
Inst
.
Dav
id M
cKin
ley
(R-W
V)Ro
bert
Lat
ta (R
-OH)
M. B
lack
burn
(R-T
N)Je
anne
Sha
heen
(D-N
H)M
ark
War
ner (
D-V
A)
Con
stru
ctio
n &
Rea
l Est
ate
Fig
ure
A.5
:E
stim
ate
dLSNM
Posi
tion
s:E
nerg
yB
ills
.W
ep
rese
nt
the
sam
evis
ual
izat
ion
asin
Fig
ure
3b
ut
for
theLSNM
infe
rred
from
ad
atase
tb
uil
tw
ith
on
lyen
ergy-r
elat
edb
ills
.T
he
sam
ecl
ust
erin
gin
toin
du
stry
-bas
edgr
oup
sar
ises
:in
add
itio
nto
anum
ber
ofm
ean
ingf
ul
ener
gy
sub
-in
du
stri
es(C
oal
,H
yd
rop
ower
,O
il,
Ren
ewab
les,
Uti
liti
es),
we
see
clu
ster
sco
rres
pon
din
gto
oth
erin
du
stri
esh
avin
gan
inte
rest
inen
ergy
legi
slati
on
(Au
tom
oti
ve,
Food
,C
onst
ruct
ion
).W
eal
soob
serv
ea
larg
e-sc
ale
div
isio
nb
etw
een
clu
ster
sp
erta
inin
gto
fuel
and
tran
spor
tati
onon
the
one
han
d,
and
clu
ster
sp
erta
inin
gto
elec
tric
ity
gen
erat
ion
and
dis
trib
uti
onon
the
oth
er(o
nth
eto
pan
db
otto
mof
our
plo
t,re
spec
tivel
y).
35
data {int<lower=2> N_row; // number of rowsint<lower=2> N_col; // number of columnsint<lower=1> D; // dimensionality of latent spaceint<lower=0> edges[N_row, N_col]; // connection strength data
}
transformed data {int flat_ix;int flat_edges[N_row * N_col];
flat_ix = 1;for (i in 1:N_row) {
for (j in 1:N_col) {flat_edges[flat_ix] = edges[i][j];flat_ix = flat_ix + 1;
}}
}
parameters {vector[D] mu_col_embedding;vector<lower=0.05>[D] cov_row_embedding_diag;cov_matrix[D] cov_col_embedding;row_vector[D] row_embedding[N_row];row_vector[D] col_embedding[N_col];
real mu_row_factor;real<lower=0.05> sigma_row_factor;real<lower=0.05> sigma_col_factor;vector[N_row] row_factor;vector[N_col] col_factor;
}
model {int flat_jx;vector[N_row * N_col] flat_log_means;
row_factor ∼ normal(mu_row_factor, sigma_row_factor);col_factor ∼ normal(0.0, sigma_col_factor);
row_embedding ∼ multi_normal(rep_vector(0.0, D), diag_matrix(cov_row_embedding_diag));
col_embedding ∼ multi_normal(mu_col_embedding, cov_col_embedding);
flat_jx = 1;for (i in 1:N_row) {for (j in 1:N_col) {flat_log_means[flat_jx] =row_factor[i] + col_factor[j] - distance(row_embedding[i], col_embedding[j]);
flat_jx = flat_jx + 1;}
}flat_edges ∼ poisson_log(flat_log_means);
}
Listing A.1: Stan code defining a sampler for the posterior distribution of the LSNM. Note that ford = 1 the model is greatly simplified, and it is much more efficient to remove the extra dimensionfrom the types related to the latent space position distribution.
36
−4 −2 0 2 4Latent Space Dimension 1
−4
−3
−2
−1
0
1
2
3
4
5
Lat
ent
Sp
ace
Dim
ensi
on
2
Typical Latent Space Positions: Politicians
−4 −2 0 2 4Latent Space Dimension 1
−4
−3
−2
−1
0
1
2
3
4
5
Lat
ent
Sp
ace
Dim
ensi
on
2
Typical Latent Space Positions: Lobbying Clients
Figure A.6: Latent Space Position Uncertainties. One of the cardinal advantages of theBayesian formulation of our latent space model is that it allows us to investigate the posteriordistribution beyond a point estimate of the parameters. We illustrate here an estimate of the“typical set” of each agent’s latent space position, an ellipse containing most of the samples drawnof their position by our MCMC algorithm. The computations giving the parameters of these ellipsesare detailed in Section A.3.3. The uncertainties are visibly small enough that we may surmise theclusters drawn in Figure 3 to be “stable”, in the sense that similar clusters would appear in atypical draw from the posterior distribution.
Popularity Factor Variances For popularity factors αi and βj , it is more convenient to consider
the variance of the estimates of the exponentials exp(αi) and exp(βj): first, these are non-negative
and so their statistics are easier to understand, and second they are the quantities we visualize
and are ultimately more interested in, since the interaction mean scales linearly in each. Thus, we
compute the standard error (standard deviation divided by mean) of each of these quantities. As
we did for the position variances, since there are many popularity factors and we analyze them in
this work only at a coarse-grained level, we prefer to visualize all of the standard errors at once,
giving in Figure A.7 their histograms for the main dataset. We observe that the distributions of the
standard errors are strongly concentrated on the low end; manual inspection reveals that the agents
with the highest standard errors are those with the lowest degrees, and so are less significant to
our substantive analysis anyway (these agents also tend to have smaller popularity factors, which
can further inflate the standard error). In both the latent positions and popularity factors, we see
that the politician parameters have more uncertainty than the lobbying client parameters, which is
consistent with our model finding that the latent space structure is imposed primarily by lobbying
client industry groupings.
37
0.1 0.2 0.3 0.4Standard Error of exp(αi)
0
25
50
75
100
125
150
Nu
mb
ero
fL
ob
byin
gC
lien
ts
0.0 0.2 0.4 0.6 0.8Standard Error of exp(βj)
Nu
mb
ero
fP
olit
icia
ns
Figure A.7: Latent Space Model Popularity Factor Uncertainties. Distributions of thestandard errors (standard deviation as a fraction of the mean) of the sampled popularity factorsfor all lobbying clients and all politicians.
Trace Plots In Figure A.8, we show two representative sets of trace plots for latent space position
and popularity factor parameters for one lobbying client and one politician. Generally, and as is
visible in these particular examples, the marginal distributions for lobbying client parameters mix
more rapidly than those of politician parameters, and the marginals for popularity factors mix
more rapidly than those of latent positions.
A.3.4 Identifiability in Multidimensional Log-Quadratic Latent Space Models
Latent space models that measure distance between latent positions with the Euclidean distance
‖θi−ψj‖ typically have means depending (after a suitable logarithmic or sigmoidal transformation
for Poisson or Bernoulli distributions for the incidence matrix entries) linearly on either ‖θi −ψj‖or ‖θi − ψj‖2 (we call these log-linear and log-quadratic latent space models, respectively). We
chose the former dependence for our model after finding that it gave a latent space that was better-
behaved numerically and more interpretable. In this section, we give a brief heuristic mathematical
argument and some numerical evidence on our datasets for why this might be the case.
We suggest that the issue may arise from a new invariance in the log-quadratic LSNM, which
only appears in models with ambient dimension d > 1 (explaining why, as far as we know, it has
not been observed previously). The problem of indentifiability (or “aliasing”) as it arises in our
model is that the log-mean of the Poisson distributions we assign to Ai,j , given by
logµi,j = αi + βj + ‖θi −ψj‖, (13)
38
0.5
0.0
0.5
Politician: Dimension 1
1.0
0.5
Client: Dimension 1
1.0
0.5
0.0Politician: Dimension 2
0.5
1.0
Client: Dimension 2
0 200 400 600Iteration (Every 10)
1.0
0.5
Politician: Popularity Factor
0 200 400 600Iteration (Every 10)
0.4
0.6
0.8
Client: Popularity Factor
Figure A.8: Latent Space Model Trace Plots. We show trace plots for the model parameters(two latent space dimensions and one popularity factor) drawn from two (out of four) MCMCchains, for one client, Intel Corporation, and one politician, Rand Paul (R-KY).
admits certain transformations of (α,β,θ,ψ) that do not change the mean, and therefore do not
change the modeled distribution. This can create multimodal or ridged posterior distributions,
making sampling more difficult, so we would like to add additional assumptions (in Bayesian data
analysis this is usually done by adding “more informative” priors) to remove these symmetries.
Two classes of symmetries are easy to see: any constant can be added to all αi and subtracted
from all βj , and any translation, rotation, or reflection can be applied to both θi and ψj . These
identifiability issues are well-understood in the literature, and are addressed specifically for our
model in the main text.
But when we instead consider means νi,j satisfying
log νi,j = αi + βj + ‖θi −ψj‖2, (14)
we believe that a more subtle identifiability problem arises, which depends on all four of the
39
parameters together. We expand the distance term, obtaining
log νi,j = αi + βj +d∑`=1
(θi` − ψj`)2 =
(αi +
d∑`=1
θ2i`
)+
(βj +
d∑`=1
ψ2j`
)− 2
d∑`=1
θi`ψj` , (15)
and consider a family of transformations parametrized by d non-zero constants K1, . . . ,Kd, and
taking the form
θi` 7→ K`θi` for ` ∈ [d] (16)
ψj` 7→1
K`ψj` for ` ∈ [d] (17)
αi 7→ αi +
d∑`=1
(1−K2` )θ2i` (18)
βj 7→ βj +
d∑`=1
(1− 1
K2`
)ψ2j`
(19)
The idea is that the first two mappings ensure that the last term of (15) is unchanged, while the
last two adjust α and β so that the contributions due to the first two mappings are cancelled.
When d = 1, since there is just a single parameter K1, the αi either all increase or all decrease,
and the βj do the opposite, so if, as mentioned above, we have already constrained these parameters
to not admit translations, then this transformation will only be admissible if K1 = 1, which is the
identity transformation, so there is no identifiability problem. However, when d > 1, we can have
a non-trivial transformation with the αi and βj remaining centered (which we think of as∑m
i=1 αi
and∑n
j=1 βj remaining constant) so long as
0 =
m∑i=1
d∑`=1
(1−K2` )θ2i` =
d∑`=1
(1−K2` )
(m∑i=1
θ2i`
)(20)
0 =
n∑j=1
d∑`=1
(1− 1
K2`
)ψ2j`
=d∑`=1
(1− 1
K2`
) n∑j=1
ψ2j`
(21)
which generically admits solutions with K` not all identically equal to 1 when d > 1. We see
from our definitions (16) and (17) that under this family of transformations, we would expect the
latent positions to move roughly along hyperbolae. In Figure A.11, we verify this prediction with
a numerical experiment on our main dataset.
A.4 Model Comparison
We give a few points of comparison between our latent space and community models, taking the
point of view not of attempting to choose a single “best fit” model, but rather of confirming that
40
the models all consistently reflect the same underlying properties of the lobbying network. There
are two components of the models to compare: first, both the dc-biSBM and the LSNM include
latent variables playing the role of “popularity factors”, which we named in both models αi and
βj for lobbying clients and politicians respectively (though in the LSNM their role differed by
an exponential transform). In Figure A.12, we compare these values and confirm that they are
positively correlated for both politicians and lobbying clients. Second, in the LSNM we assign
latent positions to each agent, while in the community models we divide up the agents into discrete
clusters. This prompts the question of whether the latent positions of the LSNM “respect” the
clusterings suggested by the community models, i.e., of whether the communities identified the
block models are localized in the latent space. We address this question in Figure A.14, and
Figure A.15 for the biSBM and dc-biSBM models, respectively, observing that the latter both gives
less weight to high-degree nodes, and is more aligned with the LSNM geometric representation.
Figure A.10 shows the incidence matrix A, with its rows and columns ordered according to
the groupings given by the biSBM (top panel) and the dc-biSBM (bottom panel). Both models
show that there exist clear “checkerboard” community patterns in the lobbying network data, and,
as mentioned before, without degree correction the biSBM suffers from the flaw of grouping high-
degree nodes together. (Indeed, one of the client clusters found without degree correction consists
of exactly the five lobbying clients with highest degree.) The dc-biSBM, in contrast, gives much
more balanced cluster sizes and less concentration of the highest-degree agents. Degree correction
also yields cluster memberships which, especially on the lobbying client side, correspond more
closely to the clusters we found previously in the LSNM.
Lastly, Figure 5 overlays the clusters identified by dc-biSBM on the geometry obtained from
the latent space model. To facilitate a direct comparison, we show the latent positions of each
individual political actor as well as the subjective grey boundaries that we used to group clients
and politicians. We then form eight paired political communities, each consisting of one lobbying
client community and one politician community. For example, we group all clients in Client Group
1 and all legislators in Politician Group 4 and call them members of “Community 1.” Finally,
we assign different colors and markers to visually distinguish the eight political communities. The
figure shows that the community structures align remarkably well with each other: the boundaries
that we drew indeed correspond to distinct political communities or to mixtures of two or three
political communities (this is especially prevalent near the center of the latent space, which we
observed earlier did not exhibit strong industry-level clustering).
41
2 0 21D Latent Space Position: Lobbying Clients
0
10
20
30
40
50
Num
ber o
f Lob
byin
g Cl
ient
s
InsuranceFinanceEnergyConservationTechnologyTelecom
Figure A.9: We illustrate that some of the clusters we identify in the two-dimensional model appearin the one-dimensional model as well, with some of the same adjacencies between clusters, thoughthere are more “outliers” positioned apart from their clusters.
Polit
ician
s
Bipartite Stochastic Block Model
0
40
80120160
Lobbying Clients
Polit
ician
s
Degree-Corrected Bipartite Stochastic Block Model
0
40
80120160
Figure A.10: Community Model Comparison. Results of the biSBM, and the dc-biSBM on themain dataset, shown by permuting the incidence matrix to group together the clusters that eachalgorithm finds. Both models identify community structure, and, as expected, the dc-biSBM infersmuch more balanced cluster sizes.
42
−4 −2 0 2 4Latent Space Dimension 1
−4
−2
0
2
4
Lat
ent
Sp
ace
Dim
ensi
on
2
Typical Positions: Politicians (Linear)
−4 −2 0 2 4Latent Space Dimension 1
−4
−2
0
2
4
Lat
ent
Sp
ace
Dim
ensi
on
2
Typical Positions: Lobbying Clients (Linear)
−2 0 2Latent Space Dimension 1
−2
−1
0
1
2
Lat
ent
Sp
ace
Dim
ensi
on
2
Typical Positions: Politicians (Quadratic)
−2 0 2Latent Space Dimension 1
−2
−1
0
1
2
Lat
ent
Sp
ace
Dim
ensi
on
2
Typical Positions: Lobbying Clients (Quadratic)
Figure A.11: Identifiability Problems in Log-Quadratic Latent Space Model. We see thattypical latent space positions “stretch” along hyperbolae in latent space models with a Poissonmean depending log-quadratically on the distance between positions, which is much mitigated inthe same model depending log-linearly on the distance.
0.00 0.05 0.10αi Value in dc-biSBM
0
10
20
30
exp
(αi)
Val
ue
inL
SN
M
(ρ = 0.73)
0.0 0.2 0.4βj Value in dc-biSBM
0
20
40
60
exp
(βj)
Val
ue
inL
SN
M
(ρ = 0.63)
Figure A.12: Popularity Factor Comparison. We plot the popularity factors obtained by theLSNM and dc-biSBM, confirming that there is a fairly strong positive correlation between theirvalues. It is most natural to take the exponential of the factors in the LSNM in this comparison,as can be seen from the formal expressions for the models’ distributions in the main text.
43
A.5 Bipartite Link Community Model EM Algorithm
In this appendix, we present a more detailed derivation of the EM update equations for the link
community model. After discarding constant terms, the log-likelihood of this model is given by
log P(A | α,β,κ) =
m∑i=1
n∑j=1
Ai,j log
(k∑z=1
κzαi,zβj,z
)−
m∑i=1
n∑j=1
k∑z=1
κzαi,zβj,z (22)
We then apply the standard technique of introducing parameters qi,j(z) that form a probability
distribution over z for fixed i and j and applying Jensen’s inequality to obtain the objective function
of our optimization task, a lower bound on the log-likelihood:
L(A,α,β,κ, q)df=
m∑i=1
n∑j=1
k∑z=1
(Ai,jqi,j(z) log
(κzαi,zβj,zqi,j(z)
)− κzαi,zβj,z
)(23)
≤ log P(A | α,β,κ).
We then seek to maximize L via coordinate ascent. Maximizing with respect to the qi,j(z) with all
other parameters fixed simply sets these parameters to the values that make Jensen’s inequality
sharp, which are
qi,j(z) =κzαi,zβj,z∑kz=1 κzαi,zβj,z
. (24)
It is also straightforward to differentiate with respect to κz, obtaining the update
κz =
∑mi=1
∑nj=1Ai,jqi,j(z)∑m
i=1
∑nj=1 αi,zβj,z
. (25)
For the α and β parameters, we must constrain our optimization to respect the normalization that
for all z,∑m
i=1 αi,z =∑n
j=1 βj,z = 1. If add to L Lagrange multiplier terms λz(1 −∑m
i=1 αi,z) +
µz(1−∑n
j=1 βj,z), then we obtain the updates
αi,z =
∑nj=1Ai,jqi,j(z)
λz +∑n
j=1 κzβj,z, (26)
βj,z =
∑mi=1Ai,jqi,j(z)
µz +∑m
i=1 κzαi,z, (27)
44
but now we see that to obtain the desired normalization we must in fact take λz and µz such that
they cancel out these denominators and leave us with the simpler
αi,z =
∑nj=1Ai,jqi,j(z)∑m
i=1
∑nj=1Ai,jqi,j(z)
, (28)
βj,z =
∑mi=1Ai,jqi,j(z)∑m
i=1
∑nj=1Ai,jqi,j(z)
, (29)
which in particular do not involve any coupling between αi,z and βj,z, meaning that a single itera-
tion of updates suffices for the M step of our EM algorithm, simplifying the calculation substantially
compared to the nested coordinate ascents that are usually involved in mixed-membership stochas-
tic block models.
A.6 Link Community Null Model Analysis
The configuration model, a standard tool in the theory of random graphs, gives a natural way to
build a null model for our analyses. In this model, the degree distribution of the graph is retained,
but all connections are “rewired” randomly. More precisely, in our bipartite setting, if there are m
lobbying clients indexed 1, . . . ,m and n politicians indexed 1, . . . , n, suppose that lobbying client
i is connected to a total of dLi politicians, and politician j is ocnnected to a total of dPj lobbying
clients. Note that the total number of edges E is given by the sums of either of these quantities:
E =m∑i=1
dLi =n∑j=1
dPj . (30)
Now, we perform the following random process to generate a draw from the configuration model:
1. From each lobbying client i, produce dLi “stubs” or partial edges, which we may label
eLi,1, . . . , eLi,dLi
. Likewise, from each politician j, produce stubs ePj,1, . . . , ePj,dPj
.
2. Generate a uniformly random matching of the two sets {eLi,k} and {ePj,`} (note that by the
preceding remark, the two have equal size.
3. Generate a bipartite graph by putting an edge between lobbying client i and politician j
whenever eLi,k and ePj,` were matched for some k, ` in the previous step (so that the total
number of edges is the total number of pairs k, ` for which these two stubs were matched).
It is simple to verify that the configuration model satisfies the following desirable properties. First,
every draw of the configuration model has the same degree distribution as the original graph; that
is, the collections of numbers {dLi } and {dPj } remain the same. And second, the configuration
model draws a uniform sample from the bipartite graphs having those degree distributions. In this
45
sense, the configuration model is the canonical null model that retains the degree distribution of a
graph, but “forgets” all other details.
To understand how exceptional our findings with the biLCM are, we analyze the link community
distribution entropy under the configuration model for the full 113th Congress dataset here. In
particular, in Figure A.13, we compare the entropy distributions obtained from the biLCM run
on this dataset with averaged distributions obtained from many draws of the configuration model
applied to the same dataset (as we would hope, those distributions concentrate fairly well around
a single representative null entropy distribution). We see that the entropies of the link community
distributions for the actual dataset are much lower, meaning the link community distributions
are much more concentrated, than those for the null model. This gives further evidence that
the communities we find are statistically significant, and not, for instance, merely artifacts of the
graph’s degree distribution (not a farfetched possibility, as less sophisticated community detection
algorithms often suffer from detecting spurious communities consisting only of high-degree nodes).
0 1 2 3Lobbying Client Entropy
0
20
40
60
80
100
Freq
uenc
y
Null Model113th Congress
0 1 2 3Politician Entropy
0
10
20
30
40
50
Figure A.13: Link Community Entropy Distribution vs. Null Model. We plot the distri-bution of entropies of link community distributions obtained by the biLCM run on the full 113thCongress dataset, compared to an average (per histogram bin) of the same distribution for sev-eral draws of the null configuration model generated from the same dataset. We observe thatfor both lobbying clients and politicians, the actual link community distributions are much moreconcentrated than those obtained in the null model, supporting the statistical significance of thecommunities we describe in the main text.
46
5.0
2.5
0.0
2.5
Client Cluster 1 Client Cluster 2 Politician Cluster 1 Politician Cluster 2
5.0
2.5
0.0
2.5
Client Cluster 3 Client Cluster 4 Politician Cluster 3 Politician Cluster 4
5.0
2.5
0.0
2.5
Client Cluster 5 Client Cluster 6 Politician Cluster 5 Politician Cluster 6
2.5 0.0 2.5
5.0
2.5
0.0
2.5
Client Cluster 7
2.5 0.0 2.5
Client Cluster 8
2.5 0.0 2.5
Politician Cluster 7
2.5 0.0 2.5
Politician Cluster 8
Figure A.14: LSNM vs. biSBM. We plot the latent positions of politicians and lobbying clientsfrom the LSNM, split by their memberships in clusters from the biSBM, with both run on themain dataset. For the sake of visual clarity, we no longer vary the sizes of the plotted latent spacepositions based on popularity factors. Localization of groups in the latent space suggests that thelatent space model is somewhat consistent with the biSBM, but much of the fine-grained lobbyingclient clustering we observe in Figure 3 is not reflected in the biSBM results. (The largest clustersbiSBM finds are those with low lobbying and sponsorship activity, and correspond to the mostspread out clusters in these plots.)
47
5.0
2.5
0.0
2.5
Client Cluster 1 Client Cluster 2 Politician Cluster 1 Politician Cluster 2
5.0
2.5
0.0
2.5
Client Cluster 3 Client Cluster 4 Politician Cluster 3 Politician Cluster 4
5.0
2.5
0.0
2.5
Client Cluster 5 Client Cluster 6 Politician Cluster 5 Politician Cluster 6
2.5 0.0 2.5
5.0
2.5
0.0
2.5
Client Cluster 7
2.5 0.0 2.5
Client Cluster 8
2.5 0.0 2.5
Politician Cluster 7
2.5 0.0 2.5
Politician Cluster 8
Figure A.15: LSNM vs. dc-biSBM. We plot the latent positions of politicians and lobbying clientsfrom the LSNM, split by their memberships in clusters from the dc-biSBM, with both run on themain dataset. Almost all clusters found by dc-biSBM are closely localized in the latent space,showing that the dc-biSBM captures much of the same information as the latent space model.
48
A.7 113th Congress Latent Space Model Clusters
The tables below list all client and politician members of each of the clusters shown in Figure 3.
A.7.1 Cluster: Finance
Lobbying Client NameAmerican Bankers Assn. Securities Assn.American Land Title Assn.American Council of Life InsurersBank of America Corp.Capital One Financial Corp.Center for Responsible LendingCitigroup Management Corp.Clearing House Payments Co. LLCCredit Union Ntl. Assn.Community Associations InstituteCommunity Bankers Assn. of IllinoisCompass BankDeutsche Bank Securities, Inc.Fifth Third BancorpFinancial Services RoundtableGenworth Financial, Inc.Independent Community Bankers of AmericaInvestment Company InstituteIntl. Swaps and Derivatives Assn.Massachusetts Mutual Life Insurance Co.Mortgage Bankers Assn.MetLife Group, Inc.Morgan StanleyNtl. Assn. of Federal Credit UnionsNew York Bankers Assn.Petroleum Marketers Assn. of AmericaPrincipal Financial GroupRadian Group, Inc.Securities Industry and Financial Markets Assn.State Farm Insurance CompaniesNorthwestern Mutual Life Insurance Co.TIAA-CREFTransamerica CompaniesUnum GroupWells Fargo & Co.
Politician Name State PartyBarr, Garland KY RepublicanCampbell, John CA RepublicanCorker, Bob TN RepublicanDuffy, Sean WI RepublicanFincher, Stephen TN RepublicanGarrett, Scott NJ RepublicanHahn, Janice CA DemocratHultgren, Randy IL RepublicanIsakson, John GA RepublicanJohnson, Tim SD DemocratKing, Peter NY RepublicanLuetkemeyer, Blaine MO RepublicanMcHenry, Patrick NC RepublicanMiller, Gary CA RepublicanNeal, Richard MA DemocratPerlmutter, Ed CO DemocratPittenger, Robert NC RepublicanShelby, Richard AL RepublicanSherman, Brad CA DemocratWagner, Ann MO RepublicanWaters, Maxine CA Democrat
A.7.2 Cluster: Travel
Lobbying Client NameGlobal Business Travel Assn.Mariott International, Inc.Morphotrust USASabre Global, Inc.U.S. Travel Assn.United States Olympic Committee
Politician Name State PartyCoats, Daniel IN RepublicanEmerson, Jo Ann MO RepublicanGowdy, Trey SC RepublicanHeck, Joseph NV RepublicanMiller, Candice MI RepublicanQuigley, Mike IL Democrat
49
A.7.3 Cluster: Insurance & Real Estate
Lobbying Client NameACE INA HoldingsAflac, Inc.Allstate Insurance CompanyAmerican Family Mutual Insurance CompanyAmerican Insurance Assn.Assn. for Advanced Life UnderwritingAmerican Council of Life InsurersBuilding Owners and Managers Assn. Intl.Chubb CorporationCincinnati Financial CorporationFarmers Group, Inc.Hartford Financial Services GroupIndependent Insurance Agents & Brokers
of AmericaJ.P. Morgan Chase & CompanyKPMG LLPLiberty Mutual GroupNational Assn. of Insurance and Financial
AdvisorsNational Assn. of Mutual Insurance CompaniesNational Assn. of Professional Insurance AgentsNational Assn. of Real Estate Investment TrustsNationwide Mutual Insurance CompanyProperty Casualty Insurers Assn. of AmericaPrudential Financial, Inc.Real Estate RoundtableState Farm Mutual Automobile Insurance
CompanyThe Travelers Companies, Inc. and SubsidiariesThe Village of Bald Head IslandUnited Services Automobile Assn.Zurich
Politician Name State PartyBrown, Sherrod OH DemocratCapuano, Michael MA DemocratChambliss, Saxby GA RepublicanCollins, Susan ME RepublicanCrawford, Eric AR RepublicanCummings, Elijah MD DemocratDiaz-Balart, Mario FL RepublicanFarenthold, Blake TX RepublicanGrimm, Michael NY RepublicanHuizenga, Bill MI RepublicanHurt, Robert VA RepublicanMenndez, Robert NJ DemocratNeugebauer, Randy TX RepublicanPalazzo, Steven MS RepublicanRichmond, Cedric LA DemocratRoss, Dennis FL RepublicanSires, Albio NJ DemocratStivers, Steve OH RepublicanThompson, Bennie MS DemocratWilson, Frederica FL Democrat
A.7.4 Cluster: Universities
Lobbying Client NameAircraft Owners & Pilots Assn.Assn. of Science-Technology Centers, Inc.Boston UniversityCalifornia Institute of TechnologyMassachusetts Institute of TechnologyMcGraw Hill EducationNtl. Business Aviation Assn.Northeastern UniversityNorthwestern UniversityOhio State UniversityUniversity of CincinattiUniversity of IllinoisUniversity of RochesterUniversity of Southern CaliforniaUniversity of VirginiaVanderbilt University
Politician Name State PartyBachus, Spencer AL RepublicanCollins, Chris NY RepublicanDoyle, Michael PA DemocratFoxx, Virginia NC RepublicanJohnson, Henry GA DemocratManchin, Joe WV DemocratMesser, Luke IN RepublicanReed, Tom NY RepublicanReichert, David WA RepublicanRokita, Todd IN Republican
50
A.7.5 Cluster: Technology
Lobbying Client NameActavisAmazon.comAmazon Corporate LLCAmerican Assn. of Law LibrariesAmerican Express CompanyAmerican Intellectual Property Law Assn.American Mushroom InstituteAssn. of National Advertisers, Inc.Business RoundtableBusiness Software AllianceCompTIA Member Services, LLCConsumer Electronics Assn.Dell, Inc.Digital 4thDirect Marketing Assocation, Inc.Eastman Kodak CompanyEbay, Inc.EMC CorporationExperian, Inc.Facebook, Inc.Google, Inc.Hewlett-Packard CompanyIntel CorporationInteractive Advertising BureauIntl. Council of Shopping Cneters Intuit, Inc.LinkedIn CorporationMagazine Publishers of AmericaMastercardMcAfee, Inc.Microsoft CorporationNational Venture Capital Assn.Oracle CorporationPfizer, Inc.Qualcomm, Inc.Semiconductor Industry Assn.Software & Information Industry Assn.Texas Instruments, Inc.The Internet AssocationVisa, Inc.Yahoo!, Inc.
Politician Name State PartyBarton, Joe TX RepublicanBrooks, Susan IN RepublicanCarper, Thomas DE DemocratCarter, John TX RepublicanCastor, Kathy FL DemocratChaffetz, Jason UT RepublicanCoble, Howard NC RepublicanCornyn, John TX RepublicanDeFazio, Peter OR DemocratDeutch, Theodore FL DemocratEnzi, Michael WY RepublicanHatch, Orrin UT RepublicanHeck, Denny WA DemocratHolding, George NC RepublicanIssa, Darrell CA RepublicanJeffries, Hakeem NY DemocratKaptur, Marcy OH DemocratLeahy, Patrick VT DemocratLofgren, Zoe CA DemocratMcCaskill, Claire MO DemocratSalmon, Matt AZ RepublicanSchumer, Charles NY DemocratShea-Porter, Carol NH DemocratToomey, Patrick PA RepublicanWomack, Steve AR RepublicanYoder, Kevin KS Republican
A.7.6 Cluster: Gun Rights
Lobbying Client NameBoyd Gaming CorporationBridgepoint EducationEverytown for Gun Safety Action FundGun Owners of America, Inc.Ntl. Assn. for Gun RightsNtl. Rifle Assn. of America
Politician Name State PartyDavis, Danny IL DemocratJordan, Jim OH RepublicanKelly, Robin IL DemocratLowenthal, Alan CA DemocratMcCarthy, Carolyn NY DemocratStockman, Steve TX Republican
51
A.7.7 Cluster: Telecom
Lobbying Client NameAmerican Cable Assn. , Inc.Americans for Tax ReformAT&T CorporationAT&T Services, Inc. and its AffiliatesCBS CorporationCincinnati BellComcast CorporationCompetitive Carriers Assn.Cox Enterprises, Inc.CTIA: The Wireless Assn.Endo Pharmaceuticals, Inc.IHeartMedia, Inc.Loews CorporationMultistate Tax CommissionNational Assn. of BroadcastersNational Cable & Telecommunications Assn.National Telecommunications Cooperative Assn.Time Warner Cable, Inc.Twenty-First Century Fox, Inc.TwinLogic StrategiesUnited States Telecom Assn.Verizon
Politician Name State PartyAmash, Justin MI RepublicanAyotte, Kelly NH RepublicanChabot, Steve OH RepublicanCollins, Doug GA RepublicanConaway, K. TX RepublicanEshoo, Anna CA DemocratGoodlatte, Bob VA RepublicanHeller, Dean NV RepublicanKeating, William MA DemocratLatta, Robert OH RepublicanMatsui, Doris CA DemocratMcCain, John AZ RepublicanMcCaul, Michael TX RepublicanMeng, Grace NY DemocratNunes, Devin CA RepublicanRockefeller, John WV DemocratRogers, Mike MI RepublicanRush, Bobby IL DemocratWalden, Greg OR RepublicanWasserman Schultz, Debbie FL DemocratWatt, Melvin NC DemocratWyden, Ron OR Democrat
A.7.8 Cluster: Military, Veterans’ Affairs
Lobbying Client NameAmerican LegionDisabled American VeteransFleet Reserve Assn.Iraq and Afghanistan Veterans of America, Inc.Military Officers’ Assn. of AmericaParalyzed Veterans of AmericaRetired Enlisted AssocationUnited Spinal Assn.
Politician Name State PartyBishop, Sanford GA DemocratBoozman, John AR RepublicanBrownley, Julia CA DemocratBurr, Richard NC RepublicanButterfield, George NC DemocratCoffman, Mike CO RepublicanGallego, Pete TX DemocratGutirrez, Luis IL DemocratKirkpatrick, Ann AZ DemocratLangevin, James RI DemocratLarsen, Rick WA DemocratMcCarthy, Kevin CA RepublicanMiller, Jeff FL RepublicanNegrete McLeod, Gloria CA DemocratO’Rourke, Beto TX DemocratPerry, Scott PA RepublicanPingree, Chellie ME DemocratRuiz, Raul CA DemocratRunyan, Jon NJ RepublicanSinema, Kyrsten AZ DemocratTakano, Mark CA DemocratTitus, Dina NV DemocratTsongas, Niki MA DemocratWalorski, Jackie IN RepublicanWenstrup, Brad OH Republican
52
A.7.9 Cluster: Environmentalism
Lobbying Client NameAmerican Beverage Assn.American Iron and Steel InstituteAmerican Motorcyclist Assn.American RiversArkema, Inc.Assn. of California Water AgenciesBlue Green AllianceBunge North America, Inc.Council on Federal Procurement of Architectural
Engineering ServiceEarthjustice Legal Defense FundEnvironmental Working GroupGrocery Manufacturers Assn.International Code CouncilJackson County MississippiJDRF Intl.Las Vegas Valley Water DistrictLeague of Conservation VotersManagement Assn. for Private Photogrammetric
SurveyorsMetropolitan Water District of Southern
CaliforniaNtl. Corn Growers Assn.Ntl. Right to Work CommitteeNtl. Society of Professional SurveyorsNtl. Wildlife FederationNature ConservancyNorthern California Power AgencyOcean ConservancyOpen Space InstitutePublic Power CouncilSafari Club Intl.Sierra ClubSociety of the Plastics Industry, Inc.Solano CountySouthern Environmental Law CenterSouthern Nevada Water AuthorityThe Wilderness SocietyTransportation Communications Intl. UnionTrust for Public LandsUnited Phosphorus, Inc.Waggoner Engineering, Inc.Waterways Council, Inc.
Politician Name State PartyBishop, Rob UT RepublicanBishop, Timothy NY DemocratBooker, Cory NJ DemocratBordallo, Madeleine GU DemocratBoxer, Barbara CA DemocratCalvert, Ken CA RepublicanCasey, Robert PA DemocratCosta, Jim CA DemocratCrapo, Michael ID RepublicanDaines, Steve MT RepublicanDavis, Rodney IL RepublicanDeGette, Diana CO DemocratFarr, Sam CA DemocratFleming, John LA RepublicanGaramendi, John CA DemocratGibbs, Bob OH RepublicanGosar, Paul AZ RepublicanHagan, Kay NC DemocratHastings, Alcee FL DemocratHeinrich, Martin NM DemocratHorsford, Steven NV DemocratHuffman, Jared CA DemocratJackson Lee, Sheila TX DemocratKind, Ron WI DemocratKinzinger, Adam IL RepublicanLautenberg, Frank NJ DemocratLujn, Ben NM DemocratMaloney, Sean NY DemocratMcClintock, Tom CA RepublicanMurkowski, Lisa AK RepublicanPaul, Rand KY RepublicanRuppersberger, C. MD DemocratSchweikert, David AZ RepublicanScott, Austin GA RepublicanThompson, Mike CA DemocratTipton, Scott CO RepublicanUdall, Tom NM DemocratValadao, David CA RepublicanVela, Filemon TX DemocratVitter, David LA RepublicanWilliams, Roger TX RepublicanWittman, Robert VA RepublicanYoung, Don AK Republican
53
A.7.10 Cluster: Energy
Lobbying Client NameAmerican Gas Assn.American Municipal Power, Inc.American Natural Gas Alliance, Inc.American Petroleum InstituteAmerican Public Power Assn.Arch CoalBerkshire Hathaway EnergyBiotechnology Industry Organization (BIO)Black Hills Corp.BP America, Inc.Centerpoint Energy, Inc.Chesapeake Energy Corp.Chevron USA, Inc.Clean Energy Fuels Corp.CMS Energy Corp.Consolidated Edison Co. of New YorkCovanta Energy Corp.DominionDTE EnergyDuke Energy Corp.Edison Electric InstituteEnergy Future HoldingsEntergy Services, Inc.Exelon Business Services LLCExxon Mobil Corp.FirstEnergy Corp.Ford Motor CompanyGreat Plains EnergyGrowth Energy, Inc.Hannon ArmstrongHollyFrontier CorporationHyundai Motor CompanyIndependent Petroleum Assn. of AmericaIntegrys Energy Group, Inc.Koch Companies Public Sector, LLCMarathon Oil CorporationMarathon Petroleum CorporationMissouri River Energy ServicesNtl. Corn Growers Assn.Ntl. Fisheries InstituteNtl. Grid USANtl. Ground Water Assn.Ntl. Mining Assn.Ntl. Propane Gas Assn.Nextera Energy, Inc.Nisource, Inc.NV EnergyOhio Municipal Electric Assn.Olin CorporationPacific Gas and Electric Co.
Politician Name State PartyAlexander, Rodney LA RepublicanBarrasso, John WY RepublicanBennet, Michael CO DemocratBridenstine, Jim OK RepublicanCapito, Shelley WV RepublicanClarke, Yvette NY DemocratClay, Wm. MO DemocratCoons, Chris DE DemocratCramer, Kevin ND RepublicanDuncan, Jeff SC RepublicanEngel, Eliot NY DemocratEnyart, William IL DemocratFlake, Jeff AZ RepublicanFlores, Bill TX RepublicanFranks, Trent AZ RepublicanGardner, Cory CO RepublicanGohmert, Louie TX RepublicanGreen, Gene TX DemocratHarper, Gregg MS RepublicanHeitkamp, Heidi ND DemocratHoeven, John ND RepublicanInhofe, James OK RepublicanJohanns, Mike NE RepublicanLaMalfa, Doug CA RepublicanLamborn, Doug CO RepublicanLankford, James OK RepublicanLarson, John CT DemocratLoebsack, David IA DemocratMarchant, Kenny TX RepublicanMarkey, Edward MA DemocratMatheson, Jim UT DemocratMcAllister, Vance LA RepublicanMcConnell, Mitch KY RepublicanMcKinley, David WV RepublicanMeehan, Patrick PA RepublicanMica, John FL RepublicanMullin, Markwayne OK RepublicanOlson, Pete TX RepublicanPayne, Donald NJ DemocratPeters, Gary MI DemocratPeters, Scott CA DemocratPompeo, Mike KS RepublicanRahall, Nick WV DemocratRothfus, Keith PA RepublicanScalise, Steve LA RepublicanSensenbrenner, F. WI RepublicanSerrano, Jos NY DemocratShaheen, Jeanne NH DemocratShimkus, John IL RepublicanSoutherland, Steve FL Republican
54
Peabody EnergyPeabody Investments Corp.Pepco Holdings, Inc.Pinnacle West Capital Corp.Poet, LLCPPL CorporationPublic Service Enterprise Group (PSEG)Puget Sound EnergyRenewable Fuels Assn.Salt River ProjectSempra EnergySheet Metal & Air Conditioning Contractors
Ntl. Assn.Shell Oil CompanySolar Energy Industries Assn.Southern CompanyThe Fertilizer InstituteUIL Holdings CorporationVectren CorporationXcel Energy, Inc.
Stewart, Chris UT RepublicanThornberry, Mac TX RepublicanTurner, Michael OH RepublicanUdall, Mark CO DemocratWaxman, Henry CA DemocratWelch, Peter VT DemocratWhitfield, Ed KY RepublicanWicker, Roger MS RepublicanYarmuth, John KY Democrat
55
A.7.11 Cluster: Health
Lobbying Client NameAARPAlzheimer’s Assn.American Academy of Dermatology Assn.American Academy of OphthalmologyAmerican Academy of OtolaryngologyAmerican Assn. for JusticeAmerican Assn. of Neurological SurgeonsAmerican Assn. of Nurse AnesthetistsAmerican Bar Assn.American Chiropractic Assn.American College of Emergency PhysiciansAmerican College of PhysiciansAmerican College of RheumatologyAmerican College of SurgeonsAmerican College of Surgeons Professional
Assn.American Fed. of State, County, and
Municipal EmployeesAmerican Health Care Assn.American Hospital Assn.American Medical Assn.American Optometric Assn.American Physical Therapy Assn.American Society of AnesthesiologistsAmerican Society of Interventional Pain
PhysiciansAmerica’s Essential HospitalsAssn. of American Medical CollegesChildren’s Healthcare of AtlantaCity of BaltimoreCoalition of Boston Teaching HospitalsCongress of Neurological SurgeonsCornell UniversityCubist PharmaceuticalsFederation of American HospitalsGreater New York Hospital Assn.Healthcare Leadership CouncilHospital & Healthsystem Assn. of
PennsylvaniaIllinois Hopsital Assn.Indiana University Health, Inc.International Brotherhood of TeamstersIowa Hospital Assn.Jewish Federation of Metropolitan ChicagoMassachusetts Medical SocietyMichigan State UniversityNtl. Active and Retired Federal Employees
Assn.Ntl. Air Traffic Controllers Assn.National Assn. of Psychiatric Health Systems
Politician Name State PartyBera, Ami CA DemocratBlack, Diane TN RepublicanBlunt, Roy MO RepublicanBraley, Bruce IA DemocratBucshon, Larry IN RepublicanBurgess, Michael TX RepublicanClark, Katherine MA DemocratCoburn, Thomas OK RepublicanCotton, Tom AR RepublicanCourtney, Joe CT DemocratCrowley, Joseph NY DemocratDeSantis, Ron FL RepublicanDent, Charles PA RepublicanEdwards, Donna MD DemocratGerlach, Jim PA RepublicanGingrey, Phil GA RepublicanGranger, Kay TX RepublicanGraves, Sam MO RepublicanHarris, Andy MD RepublicanHirono, Mazie HI DemocratJenkins, Lynn KS RepublicanJohnson, Sam TX RepublicanKildee, Daniel MI DemocratKingston, Jack GA RepublicanPascrell, Bill NJ DemocratPierluisi, Pedro PR DemocratPitts, Joseph PA RepublicanPrice, Tom GA RepublicanRenacci, James OH RepublicanRoskam, Peter IL RepublicanSchock, Aaron IL RepublicanSchwartz, Allyson PA DemocratSessions, Pete TX RepublicanSmith, Adrian NE RepublicanTierney, John MA DemocratWalberg, Tim MI Republican
56
National Assn. of Pediatric NursePractitioners
National Fed. of Federal EmployeesNational Treasury Employees UnionNebraska Hospital Assn.NetworkNorth Shore - Long Island Jewish Health
SystemNumbersUSA ActionOchsner Clinic FoundationPartners Healthcare, Inc.Professional Aviation Safety SpecialistsRutgers: The State University of New JerseySusan B. Anthony ListTenet Healthcare CorporationThe Ickes & Enright Group, Inc.The Trustees of Purdue UniversityTufts UniversityTulane UniversityUnited Parcel ServiceUniversity of CaliforniaUniversity of Northern IowaUniversity of Wisconsin-MadisonWorld Wildlife Fund, Inc.Yum! Brands, Inc.
57