Section 29.6: Negotiation | Scaling Out AI

"We exchanged seventeen offers, discovered we wanted different things all along, and walked away each convinced we had won. That is the good outcome."
A Buyer Agent With a Reservation Value

Big Picture

Negotiation is how agents that cannot simply be ordered around reach an agreement anyway: a structured exchange of proposals, governed by a protocol, ending in a deal that each side prefers to walking away. Coordination (the previous section) assumes the agents share a goal and only need to align. Negotiation drops that assumption. Here the agents have conflicting preferences, each pursues its own payoff, and yet a joint decision must still be made: a price, a task split, a schedule. The game theory of Chapter 28 tells us which agreements are stable and which are efficient; this section is where multi-agent systems turn that theory into a running process. We will see that bargaining has a sharp equilibrium prediction, that negotiating over several issues at once can manufacture value out of nothing more than differing priorities, and that the newest negotiators in the field speak English.

A multi-agent system rarely enjoys the luxury of a single objective. A logistics platform has shippers who want low prices and carriers who want high ones. A cloud scheduler has tenants competing for the same accelerators. A team of warehouse robots must divide a pick list, and each one would rather take the short routes. In every case the agents are self-interested, their preferences conflict, and no central authority dictates the outcome. What remains is negotiation: a process by which agents with conflicting preferences exchange proposals to reach a mutually acceptable agreement. This section dissects that process into its three moving parts, derives the classic equilibrium for the two-party case, shows how multiple issues open the door to win-win trades, and ends at the research frontier where large language models negotiate in natural language.

1. The Anatomy of a Negotiation Beginner

Every negotiation, from a haggle in a bazaar to an automated procurement auction, decomposes into the same three components, and keeping them separate is the first discipline of the field. The negotiation protocol is the set of rules: who may make an offer, in what order, when, and what counts as acceptance or breakdown. The negotiation object is what is being decided: a single number like a price, or a vector of issues like price, delivery date, and warranty length together. The agents' strategies are the private decision rules each agent uses: how much to concede each round, what reservation value (the worst deal it will still accept) to hold, and how a looming deadline changes its behavior. The protocol is public and shared; the object is shared; the strategies are private and adversarial. Almost every result in negotiation theory is a statement about how a fixed protocol and object constrain the rational strategies.

The single most studied protocol is alternating offers: the two parties take turns proposing a complete agreement, and on each turn the receiver either accepts (ending the negotiation) or rejects and makes a counter-proposal. It is the backbone of bilateral bargaining, and it has the appealing property that it never lets either side stall indefinitely without cost, provided we make delay expensive. That cost is usually modeled as discounting: a deal worth $v$ today is worth only $\delta v$ next round, with a discount factor $\delta \in (0,1)$ that encodes how much the agent dislikes waiting. Discounting is what turns a protocol with no natural endpoint into one with a sharp prediction, as the next section shows.

Key Insight: Protocol, Object, Strategy Are Independent Design Choices

You can change the protocol (alternating offers, sealed bids, an auction) without changing what is being negotiated, and you can change the object (one issue or many) without changing who gets to speak when. The agents' strategies then adapt to whatever protocol and object you impose. This separation is exactly why mechanism design (Section 28.6) is powerful: the designer chooses the protocol so that the rational strategies it induces produce an outcome the designer wants. Negotiation is mechanism design seen from the participants' side of the table.

2. Bilateral Bargaining and the Rubinstein Equilibrium Intermediate

Consider the cleanest case: two agents must split a surplus of size $1$ (think of it as the gap between the highest price a buyer will pay and the lowest a seller will accept, normalized). They alternate offers, the buyer proposing first, and each round of delay shrinks the pie through the discount factors $\delta_b$ and $\delta_s$. This is Rubinstein's bargaining game, and its remarkable feature is that it has a unique subgame-perfect equilibrium (the refinement of Nash equilibrium developed in Section 28.3): a single, exact prediction of who gets what. The proposer's equilibrium share is

$$x^{*}_{\text{proposer}} = \frac{1 - \delta_{\text{responder}}}{1 - \delta_{\text{proposer}} \, \delta_{\text{responder}}}, \qquad x^{*}_{\text{responder}} = 1 - x^{*}_{\text{proposer}}.$$

The logic is backward induction. The responder will reject any offer that gives it less than what it could secure by becoming the proposer next round, so the proposer offers exactly that continuation value (no more, no less) and the responder accepts immediately. Agreement is reached in the very first round, with no costly delay, yet the threat of delay is precisely what pins down the shares. Two things fall straight out of the formula. First, patience is power: the agent with the higher $\delta$ (the one who minds waiting less) captures the larger share. Second, there is a first-mover advantage, which shrinks as the agents become equally patient. When $\delta_b = \delta_s = \delta$, the proposer's share is $1/(1+\delta)$, sliding toward an even split as $\delta \to 1$.

Figure 29.6.1 shows the alternating-offers exchange together with the bargaining zone: the band of agreements both sides prefer to no deal, bounded by the buyer's and seller's reservation values, and the point where the equilibrium lands inside it.

Figure 29.6.1: Left: the alternating-offers protocol, with the buyer and seller trading proposals that converge toward the equilibrium share. In the subgame-perfect equilibrium the deal closes in round 1; the intermediate offers are drawn to show the intuition of mutual concession. Right: the bargaining zone, the interval between the seller's reservation value (its lowest acceptable price) and the buyer's reservation value (its highest acceptable price). An agreement exists only when this interval is non-empty, and the equilibrium selects one point inside it.

The code below plays out this game for concrete discount factors. It computes the equilibrium shares from the formula, then simulates rational play to confirm the deal closes immediately, and reports which agent's patience won it the larger slice.

3. Multi-Issue Negotiation: Manufacturing Win-Win Intermediate

Splitting a single number is a zero-sum fight: every cent the buyer gains, the seller loses. This is distributive bargaining, and it is the whole story only when there is one issue. The moment there are several issues and the agents weigh them differently, a richer possibility opens up: integrative bargaining, in which the agents trade concessions on issues they care little about for gains on issues they care about a lot. The trick has a name from labor negotiations, logrolling: I give you the issue you value most, you give me the issue I value most, and we both come out ahead of any even split.

Concretely, suppose two agents negotiate three issues, and each issue has $10$ units to divide. Agent A weights the issues $(1, 6, 3)$ and agent B weights them $(6, 1, 3)$: A cares most about the second issue, B about the first, and they agree on the third. An even five-five split on every issue gives each agent a utility of $50$. But if A concedes the first issue entirely to B and B concedes the second entirely to A, while they split the third, each agent's utility jumps to $75$. No units were created; the agents simply moved each issue toward whoever valued it more. The set of agreements that cannot be improved for one agent without hurting the other is the Pareto frontier (introduced in Section 28.5), and integrative bargaining is the art of climbing onto it. The even split sits strictly inside the frontier, leaving joint value on the table that differing priorities can recover.

Thesis Thread: Negotiation Is Distributed Decision-Making Without a Coordinator

This book's spine is distributing the essential activities of an intelligent system across many machines. Negotiation distributes the decision itself. No node holds all the preferences, no node dictates the outcome, and the agreement emerges from message-passing alone (offers and counter-offers are the only traffic on the wire). That is the same shape as the all-reduce of Section 1.1: local computation, a structured exchange, a coherent global result. The difference is that here the local objectives conflict, so the protocol must do the work that a shared loss function does in training. When we reach coalition formation (Section 29.7) and task allocation (Section 29.8), the same protocol-as-coordinator principle scales from two agents to many.

4. A Runnable Bargaining Demo Intermediate

The program in Code 29.6.1 implements both halves of the story in pure Python: the single-issue Rubinstein game with discounting, and the three-issue integrative negotiation that finds a Pareto-improving trade. It depends on nothing outside the standard library, so the equilibrium and the win-win deal are computed from first principles rather than asserted.

from itertools import product

# ---------- Part 1: single-issue alternating-offers bargaining (Rubinstein) ----------
# Two agents split a surplus of size 1. Each round of delay discounts the pie by
# the agent's discount factor delta. The subgame-perfect-equilibrium share for the
# proposer is x* = (1 - delta_responder) / (1 - delta_proposer * delta_responder).
def rubinstein_shares(delta_p, delta_r):
    x_p = (1 - delta_r) / (1 - delta_p * delta_r)      # proposer's equilibrium share
    return x_p, 1 - x_p

def simulate_alternating(delta_b, delta_s, max_rounds=50):
    # Buyer proposes first. Each proposer offers the responder exactly its discounted
    # continuation value, so a rational responder accepts at once.
    xb_star, _ = rubinstein_shares(delta_b, delta_s)   # buyer share when buyer proposes
    xs_star, _ = rubinstein_shares(delta_s, delta_b)   # seller share when seller proposes
    history = []
    for r in range(1, max_rounds + 1):
        if r % 2 == 1:                                 # buyer's turn to propose
            offer_to_seller, resp_cont = 1 - xb_star, delta_s * xs_star
            accept = offer_to_seller + 1e-12 >= resp_cont
            history.append((r, "buyer", xb_star, 1 - xb_star, accept))
        else:                                          # seller's turn to propose
            offer_to_buyer, resp_cont = 1 - xs_star, delta_b * xb_star
            accept = offer_to_buyer + 1e-12 >= resp_cont
            history.append((r, "seller", 1 - xs_star, xs_star, accept))
        if accept:
            break
    return history

print("=== Single-issue alternating-offers bargaining (Rubinstein) ===")
db, ds = 0.90, 0.80
xb, _ = rubinstein_shares(db, ds)
print(f"discount factors: buyer delta_b={db}, seller delta_s={ds}")
print(f"SPE shares (buyer proposes first): buyer={xb:.4f}, seller={1-xb:.4f}")
print("round  proposer  buyer_share  seller_share  agreement")
for r, who, bs, ss, acc in simulate_alternating(db, ds):
    print(f"  {r:>2}   {who:>6}      {bs:.4f}       {ss:.4f}       {'YES' if acc else 'no'}")

# ---------- Part 2: multi-issue negotiation, finding a Pareto-improving trade ----------
issues = ["budget", "deadline", "scope"]
wA = {"budget": 1, "deadline": 6, "scope": 3}          # A cares most about the deadline
wB = {"budget": 6, "deadline": 1, "scope": 3}          # B cares most about the budget
UNITS, resA, resB = 10, 60, 60                         # units per issue; walk-away utilities

def utility(weights, alloc_to_self):
    return sum(weights[i] * alloc_to_self[i] for i in issues)

even = {i: 5 for i in issues}                          # naive distributive 50/50 split
uA_even = utility(wA, even)
uB_even = utility(wB, {i: UNITS - even[i] for i in issues})
print("\n=== Multi-issue negotiation: distributive vs integrative ===")
print(f"weights A: {wA}")
print(f"weights B: {wB}")
print(f"even 5/5 split  -> u_A={uA_even}, u_B={uB_even}  (sum={uA_even+uB_even})")

# Logrolling: give each issue to whoever values it more (ties split evenly).
logroll = {i: (UNITS if wA[i] > wB[i] else 0 if wA[i] < wB[i] else 5) for i in issues}
allocB = {i: UNITS - logroll[i] for i in issues}
uA_log, uB_log = utility(wA, logroll), utility(wB, allocB)
print(f"logrolling deal -> A gets {logroll}")
print(f"                   u_A={uA_log}, u_B={uB_log}  (sum={uA_log+uB_log})")
print(f"both above reservation {resA}/{resB}: A {uA_log>=resA}, B {uB_log>=resB}")
print(f"joint surplus created by exploiting different priorities: "
      f"+{(uA_log+uB_log)-(uA_even+uB_even)}")

Code 29.6.1: Alternating-offers bargaining and multi-issue logrolling, from first principles. Part 1 derives and plays the Rubinstein equilibrium; Part 2 contrasts a naive even split against the integrative trade that gives each issue to the agent who values it most.

=== Single-issue alternating-offers bargaining (Rubinstein) ===
discount factors: buyer delta_b=0.9, seller delta_s=0.8
SPE shares (buyer proposes first): buyer=0.7143, seller=0.2857
round  proposer  buyer_share  seller_share  agreement
   1    buyer      0.7143       0.2857       YES

=== Multi-issue negotiation: distributive vs integrative ===
weights A: {'budget': 1, 'deadline': 6, 'scope': 3}
weights B: {'budget': 6, 'deadline': 1, 'scope': 3}
even 5/5 split  -> u_A=50, u_B=50  (sum=100)
logrolling deal -> A gets {'budget': 0, 'deadline': 10, 'scope': 5}
                   u_A=75, u_B=75  (sum=150)
both above reservation 60/60: A True, B True
joint surplus created by exploiting different priorities: +50

Output 29.6.1: The single-issue game closes in round 1 with the patient buyer taking $0.714$, matching the formula. The multi-issue trade lifts both agents from utility $50$ to $75$, a joint surplus of $+50$ created purely by routing each issue to whoever values it more.

The two halves of the output make the section's two main points concrete. The single-issue game confirms the equilibrium: the more patient buyer secures roughly $0.71$ of the pie, and the deal closes immediately because the threat of costly delay is enough. The multi-issue result is the more surprising one. Both agents end strictly better off than under the even split, and neither gave up anything it cared about. That $+50$ of joint surplus was always available; it took a negotiation that looked at all the issues together, rather than haggling each one in isolation, to find it.

Fun Note: The Pie That Grows When You Stop Cutting It

The everyday metaphor for negotiation is dividing a pie, which makes it sound zero-sum by construction. Multi-issue bargaining quietly breaks the metaphor. When the two of you value the filling and the crust differently, handing each person the part they prefer leaves more total pie than any straight cut down the middle. The pie did not grow; you just stopped insisting that fairness means everyone gets the same slice of every ingredient.

5. Argumentation-Based Negotiation Advanced

The protocols above exchange only offers: a number, or a bundle of numbers. Agents reveal nothing about why they want what they want. Argumentation-based negotiation enriches the protocol so that an agent can attach a justification to a proposal, and can challenge, support, or rebut the justifications of others. Instead of merely countering "I offer 0.55" with "I demand 0.70," an agent can say "I cannot go below 0.65 because my supplier costs rose," and the other can accept the premise, dispute it, or respond with a reason of its own. This matters for two reasons. First, justifications let agents update each other's beliefs, which can reveal that a proposed deal is infeasible or that an objection rests on a false assumption, collapsing a deadlock that pure offer-exchange would never resolve. Second, arguments can persuade: an agent may change what it is willing to accept once it understands the constraints the other side is under. Argumentation turns negotiation from a sequence of demands into a structured dialogue, at the cost of a much larger and harder-to-analyze message space.

6. Negotiation as Structured Auction, and the LLM Frontier Advanced

Bilateral bargaining is one protocol among many, and when many agents negotiate over the same object at once, the natural protocol is an auction. An auction is simply a negotiation protocol with very tight rules: the auctioneer fixes who may bid, when, and how the winner and price are determined, which removes the open-ended back-and-forth and replaces it with a single structured round (or a few). Seen this way, the mechanism-design results of Section 28.6 are the theory of how to choose those rules so that the agents' best strategies yield an efficient or revenue-maximizing outcome. Auctions reappear throughout this part as the workhorse for task allocation (Section 29.8), where robots or services bid for jobs, precisely because their tight protocol makes the agents' incentives tractable in a way that free-form bargaining does not.

Research Frontier: LLM Agents at the Bargaining Table (2024 to 2026)

The newest negotiators do not exchange numbers; they exchange sentences. A fast-growing line of work puts large language model agents on both sides of a negotiation and asks whether they reach good and fair deals. Benchmarks such as NegotiationArena (Bianchi et al., 2024) and the Cooperative Embodied Language Agent and related multi-LLM bargaining studies pit models against each other on price-splitting, resource-division, and multi-issue tasks, measuring not just whether a deal is struck but how close it lands to the Pareto frontier and how the surplus is split. The findings are mixed in an instructive way: stronger models extract more surplus and concede more strategically, models can be manipulated by framing and by aggressive opening anchors, and self-play can drift away from human-fair outcomes unless the protocol constrains it. Related agendas study LLMs as automated negotiation assistants that draft offers and justifications for a human principal, and as participants in mechanism-mediated markets where the auction rules of Section 28.6 discipline otherwise free-form natural-language bargaining. The open questions are squarely the ones this section set up: do language-native agents discover the integrative trades that differing priorities make possible, and can a protocol be designed so that fairness and efficiency survive contact with a persuasive counterpart?

Library Shortcut: NegMAS and Genius Instead of Rolling Your Own

Code 29.6.1 implements bargaining from scratch to expose the mechanics. For real automated-negotiation research and competitions you do not hand-code protocols and strategies. NegMAS (Negotiation Multi-Agent System), a Python library, provides ready-made protocols (alternating offers, single and multi-issue, time- and round-based), a catalog of standard strategies (time-dependent concession, behavior-based tit-for-tat, learning agents), built-in utility functions and reservation values, and a tournament runner that scores agents on Pareto-distance and Nash-distance automatically. The Java Genius platform plays the same role for the long-running Automated Negotiating Agents Competition (ANAC). What was dozens of lines of bookkeeping in our demo (turn order, acceptance tests, utility evaluation, Pareto computation) becomes a few lines of configuration: instantiate a mechanism, register two negotiators with their utility functions, and call run(). The library handles the protocol state machine, the strategy interface, and the post-hoc efficiency metrics that a from-scratch script would have to reimplement and verify.

Practical Example: The Procurement Bot That Stopped Leaving Money on the Table

Who: A platform engineer at a manufacturing firm automating supplier negotiations for component orders.

Situation: An existing bot negotiated each contract term separately (unit price first, then lead time, then payment terms), closing thousands of small deals a month.

Problem: Suppliers consistently ended each round near their walk-away point, and post-deal surveys showed both sides felt squeezed, with renegotiations spiking whenever a term changed.

Dilemma: Keep the simple one-issue-at-a-time bot, which was easy to reason about but provably distributive, or move to a multi-issue protocol that could find integrative trades but required modeling each supplier's relative priorities.

Decision: They moved to multi-issue negotiation, because the firm valued lead time far more than a marginal price cut on most parts, while many suppliers valued faster payment over headline price, exactly the asymmetry logrolling exploits.

How: They modeled each issue's weight per supplier from historical contracts, ran an alternating-offers protocol over the full bundle using a NegMAS-style mechanism, and tuned a time-dependent concession strategy against held-out deals before deploying.

Result: Average joint surplus per contract rose measurably (the same shape as the $+50$ in Output 29.6.1), renegotiations fell because both sides landed closer to the Pareto frontier, and supplier-satisfaction scores improved without the firm paying more on aggregate.

Lesson: Negotiating issues one at a time forces a distributive fight and hides the win-win trades. When the parties weigh issues differently, a multi-issue protocol turns the same disagreement into joint value.

We have taken negotiation from its three components, through the sharp equilibrium of two-party bargaining, into the multi-issue regime where differing priorities create value, and out to the frontier where language models bargain in prose. The thread running through all of it is that a protocol can substitute for a central coordinator when objectives conflict. The next step is to let more than two agents combine their interests: when several self-interested agents would do better as a bloc than alone, they must decide who teams up with whom and how to share the spoils. That is coalition formation, the subject of Section 29.7.

Exercise 29.6.1: Reading the Equilibrium Conceptual

Using the proposer-share formula from Section 2, answer the following without running code. (a) If both agents are perfectly patient ($\delta_b = \delta_s = 1$), what does the formula give, and why is that the right intuitive answer? (b) If the seller becomes completely impatient ($\delta_s \to 0$) while the buyer stays patient, who captures almost the entire surplus, and explain the mechanism in terms of continuation values. (c) Explain why the equilibrium has the deal close in round 1 even though the entire prediction rests on what would happen in later rounds that never occur.

Exercise 29.6.2: Reservation Values and the Empty Zone Coding

Extend Part 1 of Code 29.6.1 so each agent has an explicit reservation value (a minimum acceptable share) rather than only a discount factor. Add a check that reports "no deal" when the equilibrium share would leave either agent below its reservation value, that is, when the bargaining zone of Figure 29.6.1 is empty. Then sweep the seller's reservation value upward and find the threshold at which agreement becomes impossible for the discount factors $\delta_b = 0.9, \delta_s = 0.8$. Report the threshold and explain what it represents.

Exercise 29.6.3: How Much Does Logrolling Buy? Analysis

In Part 2 of Code 29.6.1, the joint surplus from logrolling depends on how differently the two agents weight the issues. Make the weights a parameter: let agent A weight the three issues $(1, w, 3)$ and agent B weight them $(w, 1, 3)$ for $w$ ranging over $1, 2, \ldots, 8$. For each $w$, compute the joint surplus of the logrolling deal minus that of the even split, and plot or tabulate it. Characterize how the integrative gain grows with the asymmetry $w$, and identify the value of $w$ at which logrolling provides no benefit over the even split. Relate your finding to the claim that integrative bargaining needs differing priorities to have anything to work with.