Economics.illinois.edu
Generic Entry, Pay-for-Delay Settlements, and the
Distribution of Surplus in the US Pharmaceutical
Ruben Jacobo-Rubio∗
John L. Turner†
Jonathan W. Williams‡
Using an event study approach, and unique data on Paragraph (iv) pharmaceuticalpatent litigation decisions, we estimate that brand firms value deterrence at $4.6 bil-lion on average while generic entrants value the right to enter, on average, at $236.8million. These estimates account for probabilistic district court decisions and an ap-pellate process. In 2002, the Schering-Plough vs. FTC decision led to a surge in"pay-for-delay" settlements. We estimate that surpluses at stake in decided cases are73% lower after this decision, reducing the direct (per-case) consumer surplus gainsanticipated by the 1984 Hatch-Waxman Act's procedures for early generic entry.
JEL Code: L51, I10, I18, K23.
Keywords: Paragraph (iv), generic entry, deterrence, event study, patent litigation,pay-for-delay.
∗U.S. Food and Drug Administration,
[email protected]. This research does not reflect the
views of the FDA.
†Department of Economics, University of Georgia,
[email protected].
‡Department of Economics, University of North Carolina - Chapel Hill,
[email protected].
The 1984 Hatch-Waxman Act attempts to strike a balance between promoting innovation
of new brand drugs (to enhance dynamic efficiency) and facilitating generic entry (to enhance
allocative efficiency) in the United States. One key provision, the Paragraph (iv) Abbreviated
New Drug Application (ANDA) certification process, uses patent litigation to help strike this
balance. Specifically, the US Food and Drug Administration permits generic firms to rely on
brand-firm data on safety and efficacy in seeking approval to sell copies of brand drugs, but
does not grant entry unless and until the generic firm successfully challenges all brand-firm
patents covering the active ingredient and formulations of the drug in question. As a reward,
or "bounty," the first generic firm to file for an ANDA and win a successful patent challenge
receives a 180-day marketing exclusivity upon receiving its ANDA.
Effectively, the Paragraph (iv) ANDA process seeks an average level of competition, where
entry occurs sooner against weak patents that do not hold up in court and later against strong
patents that do hold up in court. In recent years, however, brand and generic firms have
increasingly settled Paragraph (iv) litigation out of court (FTC 2010). In some settlements,
brand firms pay generic firms to delay generic entry. Such settlements are potentially anti-
competitive (Shapiro 2003; Bulow 2004), suggesting that incentives may have drifted from
the balance sought by the Hatch-Waxman Act.
We develop a novel framework to estimate the size of the stakes in Paragraph (iv) disputes
for brand and generic firms. Specifically, we use an event study of 93 patent infringement
suits during 1988-2012 to produce statistics on changes in publicly-traded brand and generic
firms' values following district court decisions. Separately, we estimate ex ante probabilities
of district court wins and losses and appellate reversals. We then adjust the estimates from
the event study with multipliers based on the ex ante probabilities, to recover values of
deterrence (for brands) and values of entry (for generics).
With these estimates, and a theoretical model of litigation, we illuminate several policy-
relevant phenomena. First, we find that brand firms value deterring entry, on average, at
about $4.6 billion. In contrast, generic firms value the right to enter at about $236.8 million
dollars (all values in 2010 dollars). The strongly asymmetric stakes in Paragraph (iv) cases
highlight the massive relative payoff to being a monopolist in drug markets, and the potential
gains that firms may reap by restricting competition.
The value of entry represents the minimum payment that a generic firm, certain to win
its Paragraph (iv) case, would accept to delay entry until the brand firm's patents expire. It
is also an upper bound for the value of the 180-day exclusivity.1 In a settlement stipulating
that the generic firm enters prior to patent expiry and retains the exclusivity, the generic
would gain (roughly) the value of entry times the probability that it would lose the case,
about $132 million on average for the cases in our data.
Second, if firms bargain to a settlement prior to litigation and agree to terms delaying
entry for as long as possible, we estimate the average bargaining surplus to be just under
$2 billion per Paragraph (iv) case in our data. Given ordinary assumptions about demand
for drugs, this is a lower bound for the additional consumer surplus realized by permitting
patent challenges under the Paragraph (iv) ANDA process, versus blocking entry for the life
of the brand firm's patents.2 Hence, this number indexes what the Paragraph (iv) ANDA
process gains, in allocative efficiency, by using patent litigation to strike its balance.
Third, to contextualize our estimates and provide a check on their reasonableness, we
regress the size of the (estimated) stakes on recent (pre-litigation) sales of the relevant
brand drug. We find that one dollar of additional yearly brand sales increases the value of
deterrence by about $7.19 and increases the value of entry by about $0.19. The projected
value of deterrence equals slightly more than recent sales times remaining patent life. The
1The opinion in the 2013 FTC v. Actavis case noted that in 2006 the Generic Pharmaceutical Association
said that the "vast majority of potential profits for a generic drug manufacturer materialize during the 180-day exclusivity period." [FTC v. Actavis 570 US at 4].
2For perfectly inelastic demand, it is exactly the additional consumer surplus earned. For such demand,
quantity sold does not change when price falls, so the lower prices after a successful challenge would yield atransfer of surplus from firms to consumers, but would not create any additional surplus.
projected value of entry—roughly 40% of 180 days' sales—closely resembles an 180-day
Cournot duopoly payoff.
Finally, we offer evidence that pay-for-delay settlements reduce the allocative efficiency
provided by the Paragraph (iv) process. After the closely-watched 2002 decision in Schering-
Plough v. FTC,3 which upheld a pay-for-delay settlement, there was a surge in such set-
tlements (FTC 2010). From a welfare standpoint, these settlements are of concern because
firms stand to gain the most from settling precisely when the brand firm is most likely to lose
the Paragraph (iv) case. If the post-Schering-P lough environment led to strong selection of
such cases out of litigation and into settlements, then we expect Paragraph (iv) cases that
do proceed to trial to have more frequent brand victories and lower stakes.
Our estimates confirm this intuition. In Paragraph (iv) litigation decisions after Schering-
Plough, the ex ante probability of an ultimate brand victory is 60%, versus just 40% for prior
cases. The average value of deterrence falls 60% after Schering-Plough, from about $8.8
billion to about $3.5 billion, while the average value of entry falls nearly 67%, from $532.0
million to $173.5 million. The average bargaining surplus falls from about $4.9 billion to
about $1.3 billion, a 73% decrease. Hence, pay-for-delay settlements appear to lower the
average allocative-efficiency surplus delivered by Paragraph (iv) litigation.
Our results may have implications for how courts apply antitrust analysis to settlements.
In addressing a 2013 split among Circuit Courts over whether pay-for-delay settlements
are anticompetitive, the Supreme Court found in FTC vs. Actavis et al. (133 US 2223
[2013]) that courts should apply a "rule of reason" when a settlement includes a "large and
otherwise unexplained" payment from the brand to the generic. Our estimates of the value
of entry show that settlements where the generic firm retains the 180-day exclusivity will
often confer value to the generic that is "large," relative to typical litigation costs. And
unlike cash payments, the value of retained exclusivity depends upon the strength of the
patents. This suggests that proper antitrust analysis under the Actavis rule may need to
3The Administrative Law Judge's decision (40 LEXIS 244 [FTC 2002]) upheld the settlement, and the
11th Circuit Court of Appeals eventually upheld it as well (402 F.3d 1056 [11th Cir. 2005]).
consider outcomes of hypothetical patent litigation.
While a number of scholars have debated the anti-competitive nature of pay-for-delay
settlements,4 there has been little empirical work. In one recent exception, Drake et al.
(2014) study announcements of settlement of Paragraph (iv) patent litigation, and capture
variables indicating whether the settlement was of the pay-for-delay variety. They find brand
firm value rises an average of 6% upon executing a settlement involving a payment from the
brand to the generic, but no increase at all for settlements without such a payment. Similarly,
McGuire et al. (forthcoming) argue that event studies are potentially useful in showing that
pay-for-delay settlements are anticompetitive.
Our results also complement related studies of Paragraph (iv) patent litigation. Using
slightly different sample selection criteria, Panattoni (2011) conducts an event study of 37
brand-firm Paragraph (iv) litigation events during 1984-2007. Like us, she finds large effects
of district court decisions on firm value. However, she does not estimate the value of deter-
rence or the effects on generic firms, which permit important insights into the implications
of pay-to-delay settlements. Branstetter et al. (2011) use a nested logit model that relies
upon aggregate sales data and focus on 17 Paragraph (iv) cases in the hypertension market
(1997-2008). In counterfactual analysis, where generic products are excluded, they claim a
static loss to consumers of $92 billion and a gain to brand firms of $14 billion. These results
imply that entry by these 17 drugs yields a net static gain to society of $78 billion.
This paper also contributes to the literature estimating patent values. For the drugs
in our data, our estimates imply that ironclad versions of the relevant patents would be
worth an average of $2.5 billion at the time of the Paragraph (iv) decision. This estimate
is important because it is a rare window on the characteristics of perhaps the most-valuable
class of patents in the world. It is well known that the distribution of patent values overall
is strongly right-skewed and has a fat right-hand tail (Harhoff, Scherer and Vopel 2003).
4For arguments that pay-for-delay settlements are both harmful and are unanticipated by the Hatch-
Waxman Act, see Hovenkamp et al. (2003), Hemphill (2006, 2009), Elhauge and Krueger (2012) and Edlinet al. (2013). For arguments that pay-for-delay settlements are not necessarily anti-competitive, see Willigand Bigelow (2004), Yu and Chatterji (2011) and Harris et al. (2014).
Moreover, traditional methods for estimating patent values, such as the market-value method
(Bessen 2009) or the renewal method (Schankerman and Pakes 1986; Pakes 1986), have a
difficult time pinning down estimates for the most-valuable patents.
Finally, our work contributes to the literature on market entry.
Generally, the lack
of exogenous reasons for the end of status quo monopolies makes it difficult to directly
estimate the value of entry and deterrence. To circumvent this, researchers often make
difficult-to-test behavioral and parametric modeling assumptions, which rely on temporal
or cross-sectional variation in market structure. These models typically take the form of
either a complete-information binary game (Bresnahan and Reiss 1990, 1991; Berry 1992;
Ciliberto and Tamer 2009) or a dynamic Markov-perfect equilibrium framework (Ericson
and Pakes 1995; Gedge et al.
However, in some industries, specific features of
the regulatory environment generate plausibly-exogenous variation that permits more direct
inference (Snider and Williams 2014). In this spirit, our application demonstrates how a
simple event study framework, along with a limited set of assumptions, can be used to
exploit the randomness of patent litigation to infer the value of entry and deterrence and
provide insights into US pharmaceutical firms' incentives to settle disputes.
2. Innovation and Entry in the US Pharmaceutical Industry
For a brand firm, drug development is long and costly. After testing a new molecule
to determine its biological activity (typically in animals), a researcher (often financed by a
pharmaceutical manufacturing firm) files an investigational new drug application (IND) to
start trials in humans. In these clinical trials, the applicant must prove safety and efficacy.5
If successful, the applicant files a New Drug Application (NDA) with the FDA; if the FDA
approves the NDA, the applicant may sell the drug in the US.
Firms pioneering new drugs typically seek patents to cover active ingredients, formula-
5Trials follow a strict, costly three-phase process. See Bradford et al. (2015, section 2.1) for further
tions, methods of use, devices and processes as they develop these innovations. To approve
a generic version of an NDA, the FDA requests that the generic applicant certify whether or
not active-ingredient and formulation patents could prevent such approval under the Hatch-
Waxman Act (Korn et al. 2009). Specifically, the Hatch-Waxman Act permits generic man-
ufacturers to bypass clinical trials by filing an Abbreviated New Drug Application (ANDA).
But the FDA grants approval of such generic drugs only if the generic can prove in court
that it can produce its version of the drug without infringing any valid brand-firm patent.
Indeed, FDA regulations lead frequently to scenarios where the outcome of patent litigation
determines whether a brand firm maintains a status quo monopoly or a generic firm is able
In the most common scenario, the FDA grants a five-year new chemical entity (NCE)
exclusivity to a pioneer drug. Once this exclusivity expires, other firms may seek to enter.
The Hatch-Waxman Act encourages entry by granting a 180-day marketing exclusivity to
the first generic applicant to both file for and successfully obtain ANDA approval.6 To earn
this exclusivity, a successful entrant must provide in its ANDA to the FDA:7
(A) a certification, in the opinion of the applicant and to the best of his [or her]
knowledge, with respect to each patent which claims the drug for which such
investigations were conducted or which claims a use for such drug for which the
applicant is seeking approval under this subsection and for which information is
required to be filed under Paragraph (i) or subsection (c) of this section,
(i) that such patent information has not been filed;
(ii) that such patent has expired;
(iii) of the date on which such patent will expire, or
(iv) that such patent is invalid or will not be infringed by the manufacture,
use, or sale of the new drug for which the application is submitted;
6Sections 505(j)(5)(B)(iv) and 505(j)(5)(D) of the FDCA regulate the 180 DE.
7Federal Food, Drug, and Cosmetic Act (21 USC. 355); Section 505; Subsection (j)(2)(A)(vii)(IV).
(B) if with respect to the drug for which investigations described in Paragraph
(i)(A) were conducted information was filed under Paragraph (i) or subsection
(c) of this section for a method of use patent which does not claim a use for
which the applicant is seeking approval under this subsection, a statement that
the method of use patent does not claim such a use.
The four different types of certifications (A)(i)-(iv) are known, respectively as "Paragraph
(i)-Paragraph (iv)" certifications.
Paragraph (i)-(iii) certifications lead to entry, but no
patent litigation. When a firm pursues entry under Paragraph (iv), however, the FDA may
only review, and perhaps tentatively approve, an ANDA if the brand firm initiates a patent
infringement lawsuit in response to the certification. However, FDA would not grant final
marketing approval to such ANDA until the infringement lawsuit is resolved or the respective
patents expire.
The FDA Orange Book lists three basic types of patents: active ingredient, formula-
tion and method of use.8 Under Section (B) above, the generic applicant can often satisfy
the FDA's requirement for granting the ANDA, with respect to method-of-use patents, by
specifying that it will not sell the drug for the patented methods. Active ingredients in phar-
maceutical patents are typically claimed by their chemical structure. To receive an ANDA
approval, a generic must essentially copy this chemical structure in its drug. Hence, active-
ingredient patents would nearly always be found infringed in Paragraph (iv) patent litigation.
However, a generic firm may still win a patent lawsuit against an active-ingredient patent
by successfully arguing that it is invalid. For patents covering formulations, by contrast, the
generic may win by proving either invalidity or non-infringement.
After receiving notice that a generic is pursuing an ANDA (iv), a brand firm has 45
days to initiate a lawsuit. If the brand firm sues within this window, the FDA's approval of
the ANDA is stayed until the earliest of the following: (1) the patents expire; (2) the court
decision is issued; (3) the 30-month stay expires (FTC 2002). Note that the 30-month stay
is important because it gives incumbent firms incentives to initiate litigation even in cases
8See §314.53 of FDA regulations, and FDA proposed rules at 67 Fed. Reg. 65448-65.
where they have a low probability of winning. 9 The FTC reports that the FDA usually
takes over 25 months to approve the ANDA even when no litigation occurs.10 By filing the
first Paragraph (iv) ANDA, a generic firm can delay entry of another generic firm even when
the first one has not succeeded in a litigation case but the second generic has (Korn et al.
Brand and generic firms sometimes settle their disputes rather than go through Paragraph
(iv) litigation. Through 2000, there were at least nine settlements where the brand made
a payment to the generic, suggesting anticompetitive motives (FTC 2002; Shapiro 2003;
Bulow 2004). Beginning in 2000, the FTC initiated prosecutorial actions against pharma-
ceutical firms over four settlements: Hoechst-Andrx (Cardizem), Abbott-Geneva (Hytrin),
Bristol-Shein (BuSpar) and Schering-Upsher-Smith (K-Dur). The first three of these set-
tlements included maximal entry dates, as well as anticompetitive stipulations that were
clearly outside the scope of the patents: e.g., agreements by generics not to enter with any
product using the brand's active ingredient, and agreements that the generic would not give
up or trigger the 180-day exclusivity (Bulow 2004). Each of these three cases entered into
consent decrees. Schering-Plough and Upsher-Smith, whose settlement over K-Dur did not
include anticompetitive measures outside the scope of the (hypothetically ironclad) patent,
and which negotiated entry dates prior to patent expiry, instead contested the case. The
FTC's actions sharply reduced reverse settlement activity during 2000-04 (FTC 2010).
However, despite the FTC's efforts to curtail pay-for-delay settlements, on June 27,
2002, an Administrative Law Judge upheld the Schering-Upsher-Smith settlement. Although
the decision was appealed and reversed by the full Commission, the 11th Circuit Court of
9Prior to 2003, different ANDA filings for different patents of the same drug caused multiple 30-month
stays when litigated. Furthermore, after 1998, a court decision of dismissal, a certified settlement, or non-infringement/invalidity of patents can trigger approval and the 180-day exclusivity. Prior to 1998 only asuccessful decision (patents invalid or not infringed) triggered approval (Korn et al. 2009).
10In March 2000, the FDA also issued guidelines for what constitutes a triggering court decision. For cases
where the FDA approves an ANDA (iv) due to the expiration of the 30-month stay, most generic firms waituntil the district court decision to begin marketing; if they market before an adverse court decision, thenthey may be liable for lost profits to the brand firm if they lose the case.
11The decision in Mova Pharm. Corp. vs. Shalala (955 F.Supp. 128 [D.D.C. 1997], aff'd, 140 F.3d 1060
[DC Cir. 1998]), which invalidated the successful defense requirement, established this precedent.
Appeals eventually upheld the settlement [Schering-Plough vs. FTC, 402 F.3d 1056 (11th
Cir. 2005)]. In this and two subsequent cases in other Circuits,12 the Appellate Courts
endorsed a "scope of the patent" test for whether the agreements were anticompetitive.
Under this test, if the agreement is permissible conditional on an ironclad patent, then it
is not anticompetitive. The Supreme Court declined to hear any of these cases. A surge in
pay-for-delay settlements ensued (FTC 2010). Later settlements have typically avoided the
type of aggressive stipulations found in the early agreements, and firms have often obscured
the size of the reverse payment.
In 2012, the 3rd Circuit Court of Appeals rejected the "scope of the patent" test and
found the Schering-Plough settlement anticompetitive in an antitrust case brought by various
purchaser groups.13 This created a Circuit Court split, prompting the Supreme Court to
engage the reverse-settlement question. In a June 2013 decision over a reverse settlement
for the drug Androgel [FTC vs. Actavis, Inc. (133 US 2223 [2013]), the Supreme Court
remanded the case back to the 11th Circuit Court of Appeals, and instructed courts to
apply a "rule of reason" analysis whenever a settlement includes a "large and otherwise
unexplained" payment from the brand to the generic (Hovenkamp forthcoming). Numerous
cases subject to the Actavis rules remain in litigation.
3. Theoretical Model
As a foundation for our empirical analysis, we introduce the following model of the
Paragraph (iv) litigation process. Consider a market where a risk-neutral brand firm (B)
operates as a monopolist and a risk-neutral generic firm (G) seeks entry. If the brand firm
wishes to deter entry, it initiates litigation. If the brand firm is ultimately successful, the
generic firm cannot enter and the brand firm's monopoly continues. If the brand firm is
12See In re Tamoxifen Citrate Antitrust Litigation (466 F.3d 187 [2d Cir. 2006]) and In re Ciprofloxacin
Hydrochloride Antitrust Litigation (544 F.3d 1323 [Fed Cir. 2008]). See also Elhauge and Krueger (2012,pp. 285-87) for a more complete discussion of the reasoning of the Circuit Courts of Appeal.
13In Re: K-Dur Antitrust Litigation, 686 F.3d 197 (3rd Cir. 2012)
Figure 1: A Model of Paragraph (iv) Patent Litigation
Note: This figure shows the Paragraph (iv) resolution process.
unsuccessful, the generic firm can enter and the brand firm's monopoly ends.
Figure 1 shows a game tree mapping the outcomes of litigation.14 In the pre-litigation
period, at the top of the tree, firms and investors form expectations of future payoffs prior
to any decisions. Then, nature decides whether the brand or generic wins the case at the
14We do not model the selection process into litigation. We find little evidence that there is any substantial
selection into litigation based on observable case-specific covariates, particularly those like sales that wouldbe reflective of welfare. In fact, the annual sales in our sample of drugs involved in litigation averages justover $1 billion, which is slightly larger than the average annual sales reported by Drake et al. (2014) fordrugs involved in cases that settled ($751 million). Thus, we expect our results to have some degree ofexternal validity. However, even without this assumption of external validity, our results provide valuableinsight for an important and substantial portion of pharmaceutical sales.
district court level. Let α be the probability the brand firm wins. Just after the district
court decision, firms and investors update their expectations of future payoffs. Then, in
subsequent ("appellate") review, nature determines whether the district court decision stands
or is reversed. Let βB be the probability a brand win is upheld and let βG be the probability
that a generic win is upheld.
To conserve on notation, and since nearly all decisions are appealed, we do not explicitly
model a decision to appeal. Implicitly, βB includes the probability of all scenarios such that
the district court decision is not overturned. This group includes decisions of the generic
not to appeal the decision, as well as cases where the generic does initiate an appeal but the
appellate case is either dismissed, settled, or decided in favor of the brand.
Let the ultimate profit πi for firm i ∈ {B, G}, net of litigation costs, be the following:
Brand Wins (No Entry Occurs): πB = V W in π
Generic Wins (Entry Occurs):
We assume joint profits are higher when the brand wins and the monopoly is preserved,
V W in + V Lose ≥ V Lose + V W in. These payoffs are realized only at the conclusion of the
dispute. The dispute value, V W in − V Loss, gives the stakes in the case for firm i. Denote
V W in − V Loss as the value of deterrence for the brand firm and V W in − V Loss as the value of
entry for the generic firm.
These values are not directly observed, but can be inferred using the impact of the district
court decision on firm value along with the market's expectations regarding the outcome of
the district court decision and the subsequent appellate process. Denote the expected payoffs
after the district court decision, but before any appeal, as:
Brand Wins District Court Stage:
E1{πB} = V ∗,W in E
1{πG} = V ∗,Loss
Generic Wins District Court Stage: E1{πB} = V ∗,Loss E
1{πG} = V ∗,W in
These are shown on the tree just above the appellate-review nodes. From the tree, we see
that for a brand firm,
Now consider the expected value of the brand firm at the very top of the tree,
E0{πB} = αV ∗,W in + (1 − α)V ∗,Lose.
Rearranging terms, we can write
+ (1 − α) V ∗,Loss − E
Denote V ∗,W in −E
0{πi} ( V ∗,Loss
0{πi}) as the decision impact of a win (loss), respectively,
for firm i. Then, the first term in (1) is the decision impact when a brand firm wins a
Paragraph (iv) lawsuit, weighted by the probability of a brand win. Correspondingly, the
second term reflects the decision impact when a brand firm loses the case. Doing a bit of
algebra, we find the following relationship between the decision impact and the dispute value
for i ∈ {B, G}, conditional on the district court decision:
deterrence/entry value
V W in − V Lose
Effect on G: V ∗,Loss − E
= − (1 − α) (βB + βG − 1) V W in
= −α (βB + βG − 1) V W in
Effect on G: V ∗,W in − E
These equations form the basis of our methodology. For each district court decision in
our sample, we observe two events—one firm wins and one loses. For each event (e.g.,
"Pfizer win") for which the firm is publicly-traded, we complete the following steps. We
first estimate the decision impact (left side of the equation), using an event-study routine
described in subsection 5.1. Then, we estimate the decision probabilities, α, βB, and βG,
using a nearest-neighbor technique described in subsection 5.2. Then, we use the event-study
and the nearest-neighbor results to solve for the disput value of deterrence, V W in − V Lose.
In Figure 1 we explain the litigation process, but we do not model the decision to settle
before litigation. We do not to model settlements before litigation because in this industry
brand firms often have incentives to litigate rather than settle. For example, the duration
of litigation process allows the brand firm to continue being a monopoly; thus, even in
cases where such brand firm expects to lose the case, it may choose to litigate. In light of
such incentive to litigate, our estimates can be interpreted without the need to model the
pre-litigation decision-making process. Furthermore, the duration of cases in the litigation
process allows the parties involved to reduce asymmetry of information because of the dis-
covery process. As a result, cases that do not settle likely continue to resolution not because
of asymmetric information but because brand firms ascertain their chances of a victory.
The following analysis is not necessary for estimating the dispute values, but it helps
explain the incentives to settle cases and how settlements affect welfare.
Bargaining Surplus
Prior to the start of litigation, let brand and generic firms engage in Nash Bargaining
to settle the case. Conditional on a settlement, the firms maximize the joint surplus by
maintaining the brand monopoly and achieving joint profit V W in + V Lose. If they settle, the
firms increase total profit by the difference between this joint profit and the expected joint
surplus under litigation. Thus, the bargaining surplus, SBargain, net of litigation costs, is
Bargain = [α (1 − βB ) + (1 − α)βG] V W in − V Lose
− V W in − V Lose
Let the net transaction cost of an efficient bargain, CBargain, incorporate any litigation costs.
Then, firms settle if and only if SBargain ≥ CBargain.15 We see that SBargain is increasing in
15If explicitly modeled, litigation costs from Paragraph (iv) litigation would enter negatively on the right-
hand side of this expression, while expected costs from possible antitrust scrutiny of the settlement wouldenter positively on the right-hand side.
both the probability the brand firm ultimately loses the case [the first bracketed term in (3)],
and the difference of the value of deterrence and the value of entry.16
Let demand for the drug be downward-sloping and assume competition is such that prices
are lower under generic entry. Then the relative values of consumer surplus, conditional on
firm profits, follow CS(V Loss, V W in) > CS(V W in, V Loss). Define welfare W (.) as follows:
Net of transaction and litigation costs, we have W (Settlement) = W (V W in, V Loss). Defining
γ = [αβB + (1 − α)(1 − βG)] to be the probability the brand ultimately wins, we can write
W (Litigation) = γW (V W in, V Loss) + (1 − γ)W (V Loss, V W in). Doing a bit of algebra, we have
W (Litigation) − W (Settlement) = (1 − γ) CS(V Loss, V W in) − CS(V W in, V Loss)
If demand is perfectly inelastic, then W (Litigation) = W (Settlement) because lower prices
under generic entry would merely transfer surplus from the firms to the consumers. If demand
is downward-sloping but not perfectly inelastic, then generic entry would also yield higher
sales volume, so that W (Litigation) > W (Settlement) and
CS(V Loss, V W in) − CS(V W in, V Loss) > V W in − V Lose − V W in − V Lose .
An important implication of this is that S
Bargain = (1−γ) V W in − V Lose
− V W in − V Lose
is a lower bound for the extra consumer surplus, (1−γ) CS(V Loss, V W in) − CS(V W in, V Loss),
gained by the Paragraph (iv) ANDA process, versus the alternative where generic entry is
blocked until the pivotal patent expires.
16If the firms in our model are risk-averse, then SBargain would also include a positive risk premium
because settlement removes all risk linked to litigation outcomes. We do not have data to test whethermore-risk-averse firms settle, but we do discuss the implications of risk aversion for our main results in theconclusion.
Table 1 lists the various sources for our litigation data. We capture all drug patents listed
in annual issues of the Patent and Exclusivity Addendum to the FDA Orange Book from
1985 to 2010, including those that have expired or been delisted.17 This yields 3,219 distinct
patents. On average, a brand drug, which corresponds to a unique New Drug Application
(NDA) number, has five patents listed in the Orange Book over its lifespan. We also record
all drugs and firms connected to these patents.
We match the Orange Book information to filed cases in the Derwent Litalert data.
Federal courts report all patent lawsuits to the US Patent and Trademark Office, and the
Derwent data are captured from these filings. During 1985-2010, Derwent data cover 50-70%
of all filed cases (Bessen et al. 2013). Derwent data do not include drug names or, more
To find decisions, we use our Orange Book and Derwent information to search LexisNexis
for written opinions recorded by the Federal Reporter. Opinions always include decisions,
decision dates, courts, related appellate decisions, and nearly always include correct patent
numbers and firm names. In pharmaceutical cases, they typically include drug names and
information on whether the case pertains to a Paragraph (iv) ANDA filing. Opinions do not
typically include filing dates. We match Derwent filings to LexisNexis opinions so that filing
dates may be matched to other variables.
We supplement this sample of lawsuits with information from a sample of letters from the
FDA to generic firms discussing their Paragraph (iv) ANDAs.18 The sample spans May 05,
1987-July 24, 2009, and includes 373 letters representing 200 brand drugs.19 These letters
17The 1986 OB is not available and the 1984 version did not have the patent and exclusivity addendum.
However, patents showing in immediate subsequent years reflect the patents listed in the years missing.
18These letters are archived in the FDA Biosciences Library in Silver Spring, MD. We thank Lee Hu, who
made scanned .pdf files of these letters, for providing them to us.
19We combine different formulations and dosages under one drug name. This does not change the inter-
pretation of our results, because a single formulation is often responsible for most of a drug's sales.
record the first generic to file, the listed patents for a particular drug and which ones face
Paragraph (iv) certifications. Also, 198 of the letters include litigation outcomes. In the
letters, we discover 28 additional Paragraph (iv) cases, 5 of which are litigated to a decision.
Where possible, we also use the ANDA letters to classify patents. When information on
a patent's type is unavailable from the ANDA letters, we classify each patent claim as an
active-ingredient claim either if the first noun in the claim is "compound" (or derivatives of
this word) or if the claim simply reproduces a chemical formula. We then classify a patent
as an active-ingredient patent if it has at least one active-ingredient claim.20 We compare
our classification versus the letter-based classifications, and misclassify just three out of 953
patents in the ANDA letters (0.3%).21
Figure 2: Trends in Patents (OB) and Drug Sales (IMS) 1985-2010.
Patents w/ AI Claim
Patents w/o AI Claim
(a) Patents in Orange Book.
(b) Annual Drug Sales
Note: The patent trends reflect cumulative patents showing in a given year according to every annual edition
of the OB. Sales are for the top 1000 drugs listed by IMS a given year. All dollar figures are standardized
to 2010 US dollars.
Figure 2(a) presents the total number of patents listed in each edition of the Orange
20This is similar to Hemphill and Sampat (2011, 2013).
21The letters also sometimes include information about Paragraph (iii) certification filings for a subset of
listed patents. Of the 953 patents in these letters, 5% face Paragraph (iii) certifications. Most patents facingParagraph (iii) (79%) have an active ingredient claim.
Book, as well as the number of those patents that have at least one active-ingredient claim.
The proportion of patents claiming an active ingredient has steadily declined as the total
number of listed patents has increased substantially. Patenting has increased overall, and—
see Figure 2(b)—closely tracks growth in sales from 1985 to 2010. The low of 4.8 patents
per billion dollars of sales occurs in 1995, while the high of 8.7 occurs in 1985.
Figure 3: Trends in Paragraph (iv) Litigations and Decisions from 1985-2010.
# Cases Litigated, Not Decided
Note: These Paragraph (iv) challenges count only the first challenge per drug, where different dosages and
formulations of the same drug-name are treated as one. The trend of "Cases Litigated, Not Decided" is
only the gap between the top-most trend (representing all filed cases) and cases decided, and it is comprised
mostly of cases that settled rather than cases that are still pending. "Year" is the year lawsuits are filed.
Figure 3 shows the number of Paragraph (iv) litigations across time, based on when the
lawsuit is initiated, along with the number of decisions. Note that the number of these
litigations, which represent generic entry attempts, closely tracks the trends in sales and
patenting. Moreover, the widening gap between total litigations and decisions suggests firms
have more frequently settled cases in recent years (Greene and Steadman, 2010).22
For the entire period (1985-2010), 301 generic Paragraph (iv) certifications are challenged
in court by the incumbent brand firm. Of these, 159 are litigated to a decision, all between
1988-2012. These data include only the first Paragraph (iv) challenge per drug. We compare
22The big drop in decisions for the 2010 year is because many of the cases beginning in this year were not
yet resolved during the collection of our data. See also FTC (2010, 2013) for similar trends in settlements.
our data to FTC (2002), which includes a comprehensive list of drug and firm names for 104
Paragraph (iv) ANDAs during 1992-2000. Of these ANDAs, 75 are litigated to a district
court decision. Our data construction misses just one of the 75 cases (we add this case).23
This gives us confidence that our complete data set includes the disproportionate majority
of litigations initiated during 2000-2010 as well.
4.1. Final Sample
Among the 159 Paragraph (iv) decisions, we restrict attention to cases where there was
no generic entry into the market of any drug with the same active ingredient prior to the
district court decision.24 We also drop six Paragraph (iv) cases where the decision did not
pertain to the validity and/or infringement of the patents.25 When there are multiple cases
involving the same active ingredient, or the same patent(s), we use the first case—this drops
nine cases. In applying the event study methodology, we drop seven additional cases.26
Hence, our final sample for empirical estimation includes 93 drug-cases, with the first
decision occurring in 1988. Note that our inclusion criteria are less strict than restricting
attention to just former NCE drugs. Indeed, our sample includes 20% of drugs approved
prior to the establishment of the NCE exclusivity in 1984. We match drugs to sales data
from IMS Health. Finally, we match the firms involved in Paragraph (iv) decisions with
their stock returns from CRSP and company information from COMPUSTAT. We use SDC
Platinum by Thomson Financial Securities Data to track mergers and acquisitions (M&A).27
23FTC (2002) also states that there were 26 Paragraph (iv) decisions during 1984-91 but does not record
drug or firm names. Our matching of Orange Book patents to Derwent and LexisNexis records, plus theANDA letters, captures 16 decisions during this period.
24This eliminates 44 cases. We rely on the FDA Orange Book and its website (Drugs@FDA) to determine
when any generic entry occurs. We also check with Factiva and LexisNexis news sources if a generic firmlaunches at risk during the litigation proceeding and before the district court decision due to the expirationof the 30-month stay. This eliminates just one drug (Neurontin).
25Four involve issues about patent extensions, one involves a use code associated with the patent, but not
the patent itself. The sixth case (Nolvadex) involves the generic firm (Barr) facing the threat of being shutdown by the FDA.
26Two involve the same firm and are so closely timed that the event windows overlap. Five cases do not
have public information for the firms involved at the time of the district court decision.
27SDC covers all corporate transactions from 1962-present. Prior to 1992 it reports cases involving at least
4.2. Descriptive Statistics
Table 2 shows descriptive statistics at the case level. The average drug realized just over
$1 billion in sales the year the lawsuit commenced. Lawsuits involve an average of about
two patents. Moreover, one in every two cases includes an active-ingredient patent. For 61%
of the cases, the generic and the brand are both public firms.
We classify each decision—district or appellate—as a brand win if one or more patents
are found valid and infringed. The brand wins about 57% of the time in the district-court
decision. Among district-court decisions, an appellate decision is also reached about 72% of
the time. Generics win 5 of 36 appeals of brand wins (about 14%), and achieve reversals in
5 of 53 district-court brand wins (about 9%). Brands win 6 of 31 appeals of generic wins
(about 19%), and achieve reversals in 6 of 40 district-court brand losses (about 15%).28
Cases typically occur late in the life of the patents. The last patent to expire (youngest
patent) typically has just 6.3 years of life after the district-court decision. The first patent
(oldest patent) is about one year older. Also, the district-court decision is reached 5.3 years
after the expiration of the NCE exclusivity, about ten years after the drug's approval.
Table 3 highlights characteristics of brand and generic events. The 93 cases in our sample
yield 82 public-firm brand events and 68 public-firm generic events, where an event is a firm-
decision pair. Brand firms are three times as large as generic firms on average. The total
number of brand firms is 26 (approximately 3.2 litigations per firm) and the number of
generic firms is 18 (approximately 3.8 litigations per firm).
To identify the value of deterrence and entry using an event study, we need the district
court's decision to represent a sudden, exogenous release of information to investors regarding
generic entry. If the stock market aggregates this information efficiently (Fama 1970), then
5% of the ownership of a company where the transaction was valued at $1 million or more. After 1992, dealsof any value are reported.
28For reversal calculations, which are pertinent for estimating βB and βG, we count cases not appealed as
maintaining the district court decision.
Figure 4: Mean Return for Brand Firms Around District Court Decision
Weeks Prior to Decision
Weeks After Decision
Weeks Prior to Decision
Weeks After Decision
Note: This figure shows the coefficient estimates from a regression of brand firms' returns on dummy
variables for both the days immediately after the district court decision and the weeks prior to and following
the district court decision. These trends of mean returns do not yet account for the full structure of the
event study, rather they motivate the appropriateness of an event study.
changes in firms' stock prices reflect the decision's impact on the firms' valuations. The
following exercise suggests that these conditions hold in our context.
When a brand firm wins the district court decision, the brand firm's stock price should
increase. Conversely, when the brand firm loses, its stock price should decrease. Figures
4(a) and 4(b), which show the average return and a 95% confidence interval for the 20 weeks
surrounding the decision, confirm this basic intuition. On the day following a brand win,
brand firms' market value increases by an average of 1%. After a loss, brand firms' value
decreases by an average of more than 1.5%. For both types of events, the only statistically
significant variation in returns occurs on the day following the event.
The results for generic returns, highlighted in Figures 5(a)-(b), follow a nearly identical
pattern. On average, on the day following the decision, generic firms' market value increases
by about 2.3% when the generic wins. In contrast, generic firms' value falls by about 1.6%
when the challenge fails.
Figure 5: Mean Return for Generic Firms Around District Court Decision
Weeks Prior to Decision
Weeks After Decision
Weeks After Decision
Note: This figure shows the coefficient estimates from a regression of generic firms' returns on dummy
variables for both the days immediately after the district court decision and the weeks prior to and following
the district court decision. These trends of mean returns do not yet account for the full structure of the
event study, rather they motivate the appropriateness of an event study.
5. Econometric Model
We estimate, for each event, different components of the equations in (2) from our the-
oretical model. Then we calculate an estimate of the dispute value for that event. For
example, the decision impact on a brand firm from a favorable district court decision is
0{πB } = (1 − α) (βB + βG − 1)
We first use our event study to estimate
0{πB,j } for each event j. We then use
other parts of our data to estimate, for each event, values of the parameters αbj, d
βG,j. Once we have consistent estimates of each component, we can recover an estimate of
the dispute value for the brand firm,
V ∗,W in − E0{πB,j}
Once we have estimates of dispute values for all events, we can look for temporal variation
to assess the impact of the Schering-Plough decision. Also, we can use (3) to calculate how
bargaining surpluses have changed.
5.1. Estimating the Decision Impact: The Event Study Approach
Following Salinger (1992), consider the following model of stock-market returns:
ρjt = κ1 + κ2ρm +
where ρjt is stock j's return on day t, ρm is the return on the market index, and
zero-mean error. The CRSP value-weighted market index is included to separate the effect
of common factors driving market returns from the effect of the litigation decision.29
Now, consider a day-T event. The following model permits a regression of "abnormal"
returns on that day:
ρjt = κ1 + κ2ρm + ψI
where the indicator, Ijt, equals one when the market reacts to the event on day T and equals
zero otherwise.30 We estimate our model for event j, by ordinary least squares regression.
Following Panattoni (2011), we use a 271-day estimation window, t = [−271, −1].
29We exclude dividends from returns in our analysis, but our results are virtually identical if we include
30Note that the dummy variable approach suggested by Salinger (1992) is equivalent to estimating a
prediction of returns using only information prior to the event (e.g., Returns Procedure). However, thedummy variable approach is computationally easier to program and more robust in estimating standarderrors (see Salinger 1992).
consider a three-day event window, t = [0, 2], to capture the stock market's reaction the day
of the district-court decision and two days after.31
We repeat this estimation procedure for each event. This yields an estimate, b
change in market value due to the district court outcome for each firm ( B
case of a brand win). We refer to b
ψ as the estimated cumulative abnormal return (CAR).
5.2. Estimating Decision Probabilities
If investors have information about the probability the brand will win the case at the
district and appellate court levels, they will incorporate this into their expectations. Hence,
we must get consistent estimates of α, βB and βG for each case. For estimating the reversal
rates βB and βG, we count cases not appealed as maintaining the district court decision.
We consider three primary variables in the information set of investors: filing year, drug
sales during the filing year, and whether there is an active-ingredient patent. We include
year primarily because of the surge in settlements that followed the Schering-Plough decision.
If cases with a lower probability of brand success tend to settle more often, then investors
are likely to be aware of this and incorporate it into their expectations. We include sales
because firms may commit different levels of resources to research, development, intellectual
property protection and litigation, depending upon how important the drug is. Finally,
active-ingredient patents are virtually always infringed, so brands asserting such a patent
tend to prevail in litigation more frequently (Hemphill and Sampat 2011, 2013). While we
do not have strong priors regarding how affirmation rates of district court decisions may vary
with any of these factors, as we do for α, we permit both βB and βG to vary by the same
three factors during estimation.
To flexibly estimate αj, βBj, and βGj for each event, j, we employ a multidimensional
31We also use two-day and four-day windows and find nearly identical results. In addition, we compare
the dummy variable results to the returns procedure using EVENTUS (available from Wharton ResearchData Services), and the results are robust to both approaches.
nearest-neighbor estimator similar to that of Nevo et al. (2013).32 There are two primary
reasons. First, the distribution of events across time and sales, shown in Figure 6, is quite
uneven. The low frequency of events early in the sample, and the highly skewed distribution
of sales, make a bandwidth-adaptive estimator a good choice to estimate these functions.
Second, it is not clear, a priori, how the three predictors of outcomes interact (i.e., higher
sales drugs may experience a different probability of brand victories over time), which makes
the flexibility of a nonparametric approach attractive.
Figure 6: Distribution of Events by Sales and Decision Year.
Note: Annual Sales are reported for the litigation filing year, while Year of Decision corresponds to the
district court decision year.
To demonstrate this approach, consider estimation of αj. To define the nearest neighbors
for a given decision, j, we first define the closeness of this event from every other event in
32Estimates from both probit and logit specifications with interactions between covariates are very similar.
We also estimated these specifications with indicators for the court and firm, and find them to be jointlyinsignificant.
terms of sales and time, dij, as
The arguments of the standard multivariate normal density, φ, are the difference in two of
the predictors of a brand win, Y earij and Salesij, for events i and j. As suggested by Pagan
and Ullah (1999), prior to taking these differences, we normalize both variables using their
respective means and the Cholesky decomposition of the joint variance-covariance matrix.33
If case j involved an active-ingredient patent (AIj = 1), we then estimate αj as
1 dij ≥ dN , AIj = 1 1 [BrandW inj = 1]
The indicator, 1 dij ≥ dN , AI
j = 1, serves to reduce the sample used in estimation to a
fixed number of nearest neighbors for cases involving an active-ingredient patent. That is,
dN is the N th furthest case from j, or the cutoff value for inclusion in the calculation. We
set N = 15, but find our estimates to be robust to varying N . The estimates are also
robust to the choice of kernel used to define distances. Estimation of αj for cases without
an active-ingredient patent is identical, except the sample stratification indicator is now
1 dij ≥ dN , AI
j = 0. We estimate βBj and βGj for each j similarly.
Table 4 reports the main results. The event-study results are in the top section. The
average CARs are 2.08% for brand wins, -2.43% for brand losses, 3.13% for generic wins
and -1.63% for generic losses. All estimates are statistically significant and suggest that firm
value varies by about 4.5 percentage points depending upon whether the firm wins or loses.
33Since we use the distances only to rank observations rather than calculate weighted averages, a bandwidth
normalization leaves our estimates unchanged.
Estimated means and associated standard errors of the decision probabilities are shown
in the middle section of Table 4.34 The average values of α, βB, and βG are similar to averages
that can be constructed from Table 2. The surfaces in Figures 7(a)-(b) illustrate more detail
of the nearest-neighbor estimation for α. Though the surfaces are not strictly monotonic,
there are clear trends whereby the probability of a brand win in the district court case, α,
is higher for both higher-sales drugs and during more recent years. In comparing the (a)
and (b) panels, we see that the presence of an active-ingredient patent raises the overall
probability of a brand win by between 0.20 and 0.35. Analogous surfaces of estimates of βB
and βG (not shown) indicate that none of the three predictors induce substantial variation
in the probability of a given outcome.
Figure 7: Probability Brand Win, Sales and Year.
1994 1996 1998 2000 2002 2004 2006 2008 2010 2012
Annual Sales (Millions of $)
1994 1996 1998 2000 2002 2004 2006 2008 2010 2012
Annual Sales (Millions of $)
(a) Active-Ingredient Patent.
(b) No Active-Ingredient Patent
Note: This figure reflects the estimated probabilities of a brand win, α, for each event using information on
whether an active-ingredient patent is involved, sales, and date of the decision.
For each event j, we follow (4) and use CARj and the estimates of αj, βB,j and βG,j
to estimate (for a firm of type i ∈ {B, G}) the dispute value, V W in − V Lose. For brand
events, the mean value of deterrence is about $4.6 billion. For generic events, the mean
value of entry is about $236.8 million. Hence, the mean value of entry is about 5.1% of
34Standard errors are calculated using jackknife resampling.
the mean value of deterrence, highlighting the strongly asymmetric stakes in Paragraph (iv)
cases. The distributions of estimated dispute values are right-skewed, as shown by the lower
median vales of deterrence and entry ($355.9 million and $79.4 million, respectively).
The value of deterrence is closely linked to the brand's expected flow monopoly profit
minus its expected flow oligopoly profit, in the market for the drug in question and for the
remaining time that the relevant patents are in force. This exclusion value also gives us a
way to estimate the average value of ironclad versions of the patents covering the drugs. For
the 82 brand events used to estimate the value of deterrence, the average number of patents
is 1.87 per event. Hence, the average patent value for these observations is about $2.5 billion.
This is an important measure of perhaps the most-valuable class of patents in the world.
The value of entry is the generic's profit as an oligopolist. It includes the duopoly profit
that the firm would earn during its 180-day exclusivity, plus additional profit after more
generic firms enter. Notably, the value of entry is equivalent to the minimum payment that
a generic firm, certain to win its Paragraph (iv) case, would accept to stay out of the market
until the brand firm's patents expire. Hence, this is an important benchmark in evaluating
the size of observed reverse payments. Our estimated average, $236.8 million, is similar in
size to the total payments in early (1990s) reverse settlements, which typically stipulated
that the generic stay out of the market until patent expiry.35
In pharmaceutical markets, drug sales are the main determinant of flow profits (Berndt,
Kyle and Ling 2003; Reiffen and Ward 2005). Hence, if our model accurately captures
changes to firm profits, the values of deterrence and entry should be positively correlated
with the relevant drug's sales. To aid interpretation of our results, we regress (estimated)
dispute values on recent (pre-litigation) sales of the drug subject to Paragraph (iv) litigation.
Note that we do not use this relationship directly in estimating dispute values, so this exercise
is a useful test of our model and the event-study methodology.36
35Reliable cash payments are known for the settlements over Nolvadex (1993, $66.4 million), BuSpar (1995,
$72.5 million), Zantac (1995, $132.5 million) and Cipro (1997, $398.1 million). See Hemphill (2009, footnote114). These dollar figures are not adjusted for inflation.
36Sales are used only as one part of calculating expected decision probabilities. These probabilities enter
the estimation routine non-linearly, and adjust the decision impacts of brands and generics in opposite ways
The results are shown for brands (columns (1)-(2)) and generics (columns (3)-(4)) in Table
5. Columns (2) and (4) control for timing relative to the Schering-Plough decision, which is
clearly significant and which we discuss in more detail below.37 Sales explain a significant
amount of the variation in dispute values, and (as seen by comparing the R-squared values)
explains more for brand events than for generic events.
A one-dollar increase in a drug's sales is associated with a $7.19 increase in the value
of deterrence. Hence, if current sales closely reflect the brand firm's profit as a monopolist,
then our model predicts that the value of deterrence is worth just slightly more, on average,
than current profit times remaining patent life. A one-dollar increase in a drug's sales is
associated with a $0.19 increase in the value of entry (columns (2) and (4), respectively).
Hence, the value of entry is worth just about 40 percent of 180 days worth of sales.38 Relative
to a monopoly payoff, this is similar in size to a Cournot duopoly payoff for the period of
the 180-day exclusivity.
Now consider the implications of these results for settlements. Returning to Table 4, we
use Equation (3), along with average dispute values and average decision probabilities, to
estimate an average bargaining surplus of just under $2 billion. This reflects elimination of
all legal uncertainty and full exclusion of generic competition.
Under partial exclusion, where entry is delayed but the generic retains the 180-day ex-
clusivity, we cannot generally calculate how the bargaining surplus changes with the timing
of entry. However, we can estimate an upper bound for the value of retaining the 180-day
exclusivity. Retained exclusivity has value for the generic, because the settlement increases
the probability that the generic will be able to enjoy it (Hemphill 2009). Specifically, this
probability rises by the probability that the brand wins the case. Using Equation (3), aver-
age dispute values, and average decision probabilities, we find that generic firms may gain
to estimate the values of deterrence and entry.
37We also run versions of the model with patent years left, an interaction between sales and years left,
and a dummy for whether there was an active-ingredient patent. None of the coefficient estimates on thesevariables are significant.
3840 percent the fraction of a year that the 180 DE represents yields 0.197, or 0.4*(180/365).
as much as $132 million from retained exclusivity.39
More interesting is the effect of the Schering-Plough decision. Table 6 reports estimates of
average CARs, and average and median values of deterrence and entry, in the periods before
the first Schering-Plough decision (the pre-SP period) and after it (the post-SP period).
Despite the very small number of observations in the pre-SP period, we nonetheless identify
large average CARs for all four categories of events and find three of the estimates to be
statistically significant. The differences in the average CARs for wins and losses are more
than 5.5 percentage points for brands and nearly 11 percentage points for generics. Brands
win at the district court level about 34% of the time, and just 40% of the time overall.
For these events, we estimate an average value of deterrence of nearly $8.8 billion and an
average value of entry of about $532 million. The value of entry is about 6.1% of the value
of deterrence. The estimated SBargain is about $4.9 billion.
During the post-SP period, our estimates suggest that average stakes are far lower in
Paragraph (iv) cases.
Again, three of the four average CAR estimates are statistically
significant, with generic losses (the exception) estimated to have a near-zero effect. The
differences in the average CARs, for wins and losses, are smaller than in the pre-SP period.
Brands win at the district court level, and overall, about 60% of the time, a much higher
probability than in the pre-SP period. This is precisely what would occur if cases with
weaker patents (i.e., lower γ) tend to settle more often than cases with stronger patents.
We estimate an average value of deterrence of about $3.5 billion in the post-SP period,
which is about 60% lower than the estimated value of deterrence in the pre-SP period. We
estimate an average value of entry of $173.5 million, which is about 67% lower than the
estimated value of entry in the pre-SP period. The value of entry is about 4.9% of the value
of deterrence, similar to but lower than the ratio for the pre-SP period. This is consistent
with a more permissive environment for settlements, causing cases with higher stakes to tend
39Because our analysis uses just cases that reached at least one litigation decision, one might be concerned
that this yields cases where the implied value of retained exclusivity is either abnormally low or high.
However, cases that tend to complete litigation have low brand drug sales and a high probability of brandvictory, while retained exclusivity is most valuable when drug sales are high and there is a high probabilityof brand victory.
to settle more often than cases with lower stakes.
We estimate SBargain is about $1.3 billion for the post-SP period, nearly 73% lower than
in the pre-SP period. If we recall that SBargain is a lower bound for the extra consumer
surplus gained by the Paragraph (iv) ANDA process, our results suggest that Paragraph
(iv) cases during the post-SP period are gaining far less surplus than cases gained in the
pre-SP period. Hence, pay-for-delay settlements lead to a lower (per case) level of allocative
efficiency in the US pharmaceutical industry.
We develop a novel framework to shed light on the distribution of surplus in the US
pharmaceutical industry, and illuminate several policy-relevant phenomena. First, we find
that brand firms in Paragraph (iv) ANDA cases value deterring entry by far more than
generic firms value the right to enter. This suggests that firms that settle their disputes
rather than litigate would realize sizable additional surpluses. We estimate the average
bargaining surplus to be just under $2 billion per Paragraph (iv) case.
We also provide evidence that pay-for-delay settlements reduce allocative efficiency. In
Paragraph (iv) litigation decisions after the closely-watched Schering-Plough decision in
2002, estimated bargaining surpluses are far smaller than for cases prior to this decision.
This suggests that high-bargaining-surplus cases select into settlement, reducing the average
allocative-efficiency surpluses delivered by Paragraph (iv) litigation.
We are optimistic that our results will be useful for informing litigation and public policy.
Our estimates of the value of entry, in particular, help frame the "large and otherwise un-
explained payment" inquiry under the Actavis rules. Many authors argue that any payment
in excess of litigation costs should be interpreted as purchasing some delay (e.g., Edlin et al.
2015). We show that the value of retained exclusivity itself may be "large," but it depends
on the probability that the generic would win the Paragraph (iv) case. As consequence, in a
settlement with retained exclusivity but no other payments, the court would need to inquire
into the strength of the patents under hypothetical litigation.
If firms are risk-averse, then our estimates of SBargain understate the true size of bargain-
ing surpluses. If firms are strongly risk-averse, then these estimates could be higher than the
changes in consumer surplus achieved via the Paragraph (iv) process. Unfortunately, we do
not have data to study this further. Given the emphasis some commenters have placed on
risk aversion as a motivation for pay-for-delay settlement (Willig and Bigelow 2004; Harris
et al. 2014), this represents an important avenue for future research.
Despite our findings, it is clear that the Hatch-Waxman Act has achieved considerable
allocative efficiency gains by stimulating generic entry. IMS Health data show that the
generic dispensing ratio (percentage of generic to total prescriptions) reached 50% in 1999
and 84% in 2012, compared to 18.6% in 1984 (Levy 1999; IMS 2013). GPhA (2013) estimate
savings from generic prescribing in 2012 alone to be over $217 billion.
Finally, we cannot say much about dynamic efficiency. If the increased rents earned by
firms due to pay-for-delay settlements lead to a surge in new drugs with significant impact
on quality of life, then such settlements could enhance overall efficiency. However, given the
time required to develop new drugs, it is too early to expect such a surge to materialize.
This is a fruitful area for future research and we look forward to further progress.
Berndt, E.; Bhattacharjya, A.; Mortimer, R.; Parece, A.; Tuttle, E. 2007. "Authorized
Generic Drugs, Price Competition, and Consumers' Welfare," Health Affairs 26(3),
Berndt, E.; Kyle, M.; Ling, D. 2003. "The Long Shadow of Patent Expiration: Generic Entry
and Rx-to-OTC Switches," in R.C. Feenstra and M.D. Shapiro, eds., Scanner Data
and Price Indices, University of Chicago Press: Chicago, IL.
Berry, S. 1992. "Estimation of a Model of Entry in the Airline Industry," Econometrica
60(4), 889-917.
Bessen, J. 2009. "Estimates of Patent Rents from Firm Market Value," Research Policy, 38,
Bessen, J.; Neuh¨
ausler, P.; Turner, J.; Williams, J. 2013. "The Private Costs and Benefits
of United States Patents: 1984-2009," Working Paper.
Bradford, D.; Turner, J.; Williams, J. 2015. "Off-Label Use of Pharmaceuticals: A Detection
Controlled Estimation," Working Paper.
Branstetter, L.; Chatterjee, C.; Higgins, M. 2011. "Regulation and Welfare: Evidence from
Paragraph IV Generic Entry in the Pharmaceutical Industry," NBER Working Paper
Bresnahan, T.; Reiss, P. 1990. "Entry in Monopoly Markets," Review of Economic Studies
57(4), 531-553.
Bresnahan, T.; Reiss, P. 1991. "Empirical Models of Discrete Games," Journal of Econo-
metrics 48(1), 57-81.
Bulow, J. 2004. "The Gaming of Pharmaceutical Patents," in Adam Jaffe, Josh Lerner and
Scott Stern, eds., Innovation Policy and the Economy MIT Press: Cambridge, MA.
Ciliberto, F.; Tamer, E. 2009. "Market Structure and Multiple Equilibria in Airline Markets,"
Econometrica 77(6), 1791-1828.
Drake, K.; Starr, M.; McGuire, T. 2014. "Do ‘Reverse Payment' Settlements of Brand-
Generic Patent Disputes in the Pharmaceutical Industry Constitute an Anticompet-
itive Pay of Delay?," NBER Working Paper #20292. Accessed August 2014.
Edlin, A.; Hemphill, S.; Hovenkamp, H.; Shapiro, C. 2013. "Activating Actavis," Antitrust
Elhauge, E.; Krueger, A. 2012. "Solving the Patent Settlement Puzzle," Texas Law Review
91, 283-300.
Ericson, R.; Pakes, A. 1994. "Markov-Perfect Industry Dynamics: A Framework for Empir-
ical Work," Review of Economic Studies 62(1), 53-82.
Fama, E. 1970. "Efficient Capital Markets: A Review of Theory and Empirical Work,"
Journal of Finance 25:2, 383-417.
Federal Trade Commission, 2002. "Generic Drug Entry Prior to Patent Expiration: An FTC
Study," US Government Printing Office, Washington, DC, July 2002.
Federal Trade Commission, 2010. "Pay-for-Delay: How Drug Company Pay-Offs Cost Con-
sumers Billions," US Government Printing Office, Washington, DC, January 2010.
Federal Trade Commission, 2011. "Authorized Generic Drugs: Short-Term Effects and Long-
Term Impact," US Government Printing Office, Washington, DC, August 2011.
Federal Trade Commission, 2013. "Pay-for-Delay Deals: Limiting Competition and Costing
Consumers," US Government Printing Office, Washington, DC, July 2013.
Gedge, C.; Roberts, J.; and Sweeting, A. 2013. "An Empirical Model of Dynamic Limit
Pricing: The Airline Industry," Duke Working Paper.
www.gphaonline.org/media/cms/2013 Savings Study 12.19.2013 FINAL.pdf.
Harhoff, D.; Scherer, F.; Vopel, K. 2003. "Exploring the Tail of Patented Invention Value
Distributions," in Ove Grandstrand, ed., Economics, Law and Intellectual Property,
The Hague, Netherlands: Kluwer Academic Publishers, 279-309.
Harris, B.; Murphy, K.; Willig, R.; Wright, M. 2014. "Activating Actavis: A More Complete
Story," Antitrust Magazine 28, 83-89.
Hemphill, S. 2006. "Paying for Delay: Pharmaceutical Patent Settlement as a Regulatory
Design Problem," New York University Law Review 81: 1553-1623.
Hemphill, S. 2009. "An Aggregate Approach to Antitrust: Using New Data and Rulemaking
to Preserve Drug Competition," Columbia Law Review 109(4): 629-687.
Hemphill, S.; Sampat, B. 2011. "When Do Generics Challenge Drug Patents?" Journal of
Empirical Legal Studies 8, 613-649.
Hemphill, S.; Sampat, B. 2013. "Drug Patents at the Supreme Court," Science 339, 1386-87.
Hovenkamp, H.; Janis, M.; Lemley, M. 2003. "Anticompetitive Settlement of Intellectual
Property Disputes," Minnesota Law Review 87, 1719-66.
Hovenkamp, H. Forthcoming. "Anticompetitive Patent Settlements and the Supreme Court's
Actavis Decision," Minn Journal of Law, Science & Technology forthcoming.
IMS 2013. "Declining Medicine Use and Costs: For Better of Worse?," January 2014.
Korn, D.; Lietzan, E.; Shaw, S. 2009. "A New History and Discussion of 180-Day Exclusiv-
ity," Food and Drug Law Journal 64, 335.
Levy, R. 1999. "The Pharmaceutical Industry: A Discussion of Competitive and Antitrust
Issues in an Environment of Change," DIANE Publishing, 1999.
McGuire, T.; Drake, K.; Elhauge, E.; Hartman, R.; Starr, M. "Resolving Reverse-Payment
Settlements with the Smoking Gun of Stock-Price Movements," Iowa Law Review,
Nevo, A.; Turner, J.; Williams, J. 2013. "Usage-Based Pricing and Demand for Residential
Broadband," UGA Working Paper, September 2013.
Pagan, A.; Ullah, A. 1999. Nonparametric Econometrics. Cambridge University Press.
Pakes, A. 1986. "Patents as Options: Some Estimates of the Value of Holding European
Patent Stocks," Econometrica 54, 755-784.
Pakes, A.; Schankerman, M. 1984. "The Rate of Obsolescence of Patents, Research Gestation
Lags, and the Private Rate of Return to Research Resources," in Griliches, Z. ed.,
R&D, Patents and Productivity. University of Chicago Press for NBER: Chicago, IL.
Panattoni, L. 2011. "The Effect of Paragraph IV Decisions and Generic Entry before Patent
Expiration on Brand Pharmaceutical Firms," Journal of Health Economics 30, 126-45.
Reiffen, D.; Ward, M. 2005. "Generic Drug Industry Dynamics," The Review of Economics
and Statistics 87, 37-49.
Salinger, M. 1992. "Standard Errors in Event Studies," The Journal of Financial and Quan-
titative Analysis 27, 39-53.
Shapiro, C. 2003. "Antitrust Limits to Patent Settlements," RAND Journal of Economics
34, 391-411.
Snider, C.; Williams, J. 2013. "Barriers to Entry in the Airline Industry:
Dimensional Regression-Discontinuity Analysis of AIR-21," Forthcoming Review of
Economics & Statistics.
Willig, R.; Bigelow, J. 2004. "Antitrust Policy toward Agreements that Settle Patent Liti-
gation," The Antitrust Bulletin 49, 655-98.
Yu, X.; Chatterji, A. 2011. "Why Brand Pharmaceutical Companies Choose to Pay Generics
in Settling Patent Disputes: A Systematic Evaluation of the Asymmetric Risks in
Litigation," Northwestern Journal of Technology and Intellectual Property 10, 19-36.
Table 1: Pharmaceutical Patent Litigation Data Sources
Key Characteristics
Comprehensive list of patents for FDA-approved
Covers 50-70% of all US patent lawsuits (most
years), includes filing dates, settled cases.
Complete opinions include decisions, decision
dates, firms, Paragraph (iv) info, patent numbers.
Additional Sources
Comprehensive list of ANDAs, including
non Paragraph (iv) cases.
Comprehensive list of Paragraph (iv) cases1992-2000, includes drug and firm names.
P-IV ANDA Approvals
Sample of letters to generic firms regarding
successful Paragraph (iv) ANDAs, includes firstfiler, patent type and p-III certification.
Note: This table includes all sources for data used in this paper. When possible, we cross check all sources
and identify the earliest Paragraph (iv) filing per drug to identify the appropriate generic firm.
Table 2: Descriptive Statistics at Drug-Observation Level, Paragraph (iv)Cases Main Sample, 1988-2012
Lawsuits Litigated to a Decision
District Decision Reversed
District Decision Reversed
Additional Statistics
Drug Sales (millions)
Number of Patents
At Least One Active-Ingredient Patent
Drug Had NCE Status Prior to Litigation
Time Relative to district court Decision
Youngest Patent-Life Left (years)
Oldest Patent-Life Left (years)
Since NCE Expired (years)
Note: These statistics reflect a set of Paragraph (iv) litigations constructed from a variety of sources (see
Table 1), as well as patent statistics from USPTO data and drug sales statistics from IMS data. Out of
the total of 159 Paragraph (iv) lawsuits, 93 reach a decision and survive the selection criteria we apply in
constructing our main sample. All "Additional Statistics" are for this main sample of decided cases, except
for the Since NCE Expired statistics, which are restricted to NCE drugs (76). The Drug Sales statistics are
based upon the year the litigation begins, while the Blockbuster statistics are based upon whether the drug
ever achieved top-25 sales.
Table 3: Descriptive Statistics, Paragraph (iv) Litigation Events Main Sam-ple, Public Firms in Cases Litigated to a Decision (1988-2012)
Brand Firm Events
Drug Sales suit yr (millions)
Firm Employees (thousands)*
Firm Revenue (billions)*
Number of Patents
At Least One Active-Ingredient Patent
Affirmed if Appealed
Number of Unique Firms
Generic Firm Events
Drug Sales suit yr (millions)
Firm Employees (thousands)*
Firm Revenue (billions)*
Number of Patents
At Least One Active-Ingredient Patent
Affirmed if Appealed
Number of Unique Firms
Note: These statistics reflect a set of Paragraph (iv) litigations constructed from a variety of sources (see
Table 1), as well as patent statistics from USPTO data, drug sales statistics from IMS data, and firm
employment and revenue from COMPUSTAT. All statistics reflect the full set of events, except for those
marked with a star (*)—we lack information for 2 of the brand observations 5 of the generic observations.
The firm employment and revenue statistics are based upon the year of the district court decision. The Drug
Sales statistics are based upon the year the litigation begins, while the Blockbuster statistics are based upon
whether the drug ever achieved top-25 sales.
Table 4: Estimation Results
Brand Firms (i=B)
Generic Firms (i=G)
Mean CAR (Brand Wins)
Mean CAR (Brand Losses)
Decision Probability Estimation
Mean Dispute Value (V W in − V Lose)
Median Dispute Value (V W in − V Lose)
Mean Bargaining Surplus
Note: This table shows the results of an event study estimating equation (5) for the main sample, and of
estimates of decision probabilities using (7) and analogous formulas for βB and βG. All values in parenthesesare standard errors. Numbers of observations used in the event study: brand wins N=45; brand losses N=37;
generic wins N=28; generic losses N=40. For the average CAR estimates, we report results from a two-sided
test of the null hypothesis that the average CAR is zero. Standard errors for the average CAR estimates
are calculated assuming independence among events. Asterisks denote significance levels: 1%(***); 5%(**);
10%(*). The total N for the decision probability estimates is smaller than the total number of events because
the probability estimates are constructed at the case level. Standard errors for the decision probabilities
are calculated using jackknife resampling. The estimate of the mean bargaining surplus applies formula (3),
Bargain = [α (1 − βB ) + (1 − α)βG] V W in − V Lose
− V W in − V Lose , using averages reported in this
Table 5: Dispute Values Versus Brand Sales
Note: The results in column (1) reflect linear regressions of the form V W in − V Lose = C + β
β2 ∗ post − Schering − P lough + . Sales is for the year the lawsuit commenced, and post-Schering-Ploughis a dummy variable that takes the value of 1 if the decision occurs after the 2002 Schering-Plough decision.
All calculations are performed in STATA. Standard errors are unadjusted. The following denote statistical
significance: *** 1% level, ** 5% level, * 10% level.
Table 6: Estimation Results: Pre- and Post-Schering-Plough
Brand Firms (i=B)
Generic Firms (i=G)
Pre-Schering-Plough vs. FTC (2002)
Mean CAR (Brand Wins)
Mean CAR (Brand Losses)
Decision Probability Estimation
Mean Dispute Value(V W in − V Lose)
Median Dispute Value (V W in − V Lose)
Mean Bargaining Surplus
Post-Schering-Plough vs. FTC (2002)
Mean CAR (Brand Wins)
Mean CAR (Brand Losses)
Decision Probability Estimation
Mean Dispute Value(V W in − V Lose)
Median Dispute Value (V W in − V Lose)
Mean Bargaining Surplus
Note: Estimation and statistical inference in this Table use the same techniques as in the construction of
Table 4. Numbers of observations used in the pre-SP event study: brand wins N=7; brand losses N=10;
generic wins N=6; generic losses N=6. Numbers of observations used in the post-SP event study: brand
wins N=38; brand losses N=27; generic wins N=22; generic losses N=34.
Source: http://www.economics.illinois.edu/seminars/documents/Turner.Pdf
The Egyptian Cabinet Information and Decision Support Center Center for Future Studies Sustainable Cities in Egypt Learning from Experience: Potentials and Preconditions for New Cities in Desert Areas Dr. Nisreen El-Lahham Dr. Waleed Hussen September 2009 Sustainable Cities in Egypt
Regular pulsing induced by noise in a monolithic semiconductor neuron Alexander S. Samardak1, Alain Nogaret1, Stephen Taylor1, Natalia B. Janson2, Alexander G. Balanov3, Ian Farrer4, David A. Ritchie4 1 Department of Physics, University of Bath, Bath BA2 7AY, UK 2 Department of Mathematics, University of Loughborough, Loughborough LE11 3TU, UK 3 Department of Physics, University of Loughborough, Loughborough LE11 3TU, UK