Guest User

Untitled

a guest
Jun 10th, 2017
714
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 18.21 KB | None | 0 0
  1. Bitcoin: A Peer-to-Peer Electronic Cash System
  2.  
  3. Satoshi Nakamoto
  4. www.bitcoin.org
  5.  
  6. Abstract. A purely peer-to-peer version of electronic cash would allow online
  7. payments to be sent directly from one party to another without going through a
  8. financial institution. Digital signatures provide part of the solution, but the main
  9. benefits are lost if a trusted third party is still required to prevent double-spending.
  10.  
  11. We propose a solution to the double-spending problem using a peer-to-peer network.
  12. The network timestamps transactions by hashing them into an ongoing chain of
  13. hash-based proof-of-work, forming a record that cannot be changed without redoing
  14. the proof-of-work. The longest chain not only serves as proof of the sequence of
  15. events witnessed, but proof that it came from the largest pool of CPU power. As
  16. long as a majority of CPU power is controlled by nodes that are not cooperating to
  17. attack the network, they'll generate the longest chain and outpace attackers. The
  18. network itself requires minimal structure. Messages are broadcast on a best effort
  19. basis, and nodes can leave and rejoin the network at will, accepting the longest
  20. proof-of-work chain as proof of what happened while they were gone.
  21.  
  22. 1. Introduction
  23.  
  24. Commerce on the Internet has come to rely almost exclusively on financial institutions serving as
  25. trusted third parties to process electronic payments. While the system works well enough for
  26. most transactions, it still suffers from the inherent weaknesses of the trust based model.
  27.  
  28. Completely non-reversible transactions are not really possible, since financial institutions cannot
  29. avoid mediating disputes. The cost of mediation increases transaction costs, limiting the
  30. minimum practical transaction size and cutting off the possibility for small casual transactions,
  31. and there is a broader cost in the loss of ability to make non-reversible payments for nonreversible
  32. services. With the possibility of reversal, the need for trust spreads. Merchants must
  33. be wary of their customers, hassling them for more information than they would otherwise need.
  34.  
  35. A certain percentage of fraud is accepted as unavoidable. These costs and payment uncertainties
  36. can be avoided in person by using physical currency, but no mechanism exists to make payments
  37. over a communications channel without a trusted party.
  38.  
  39. What is needed is an electronic payment system based on cryptographic proof instead of trust,
  40. allowing any two willing parties to transact directly with each other without the need for a trusted
  41. third party. Transactions that are computationally impractical to reverse would protect sellers
  42. from fraud, and routine escrow mechanisms could easily be implemented to protect buyers. In
  43. this paper, we propose a solution to the double-spending problem using a peer-to-peer distributed
  44. timestamp server to generate computational proof of the chronological order of transactions. The
  45. system is secure as long as honest nodes collectively control more CPU power than any
  46. cooperating group of attacker nodes.
  47.  
  48. 1
  49. 2. Transactions
  50.  
  51. We define an electronic coin as a chain of digital signatures. Each owner transfers the coin to the
  52. next by digitally signing a hash of the previous transaction and the public key of the next owner
  53. and adding these to the end of the coin. A payee can verify the signatures to verify the chain of
  54. ownership.
  55.  
  56. The problem of course is the payee can't verify that one of the owners did not double-spend
  57. the coin. A common solution is to introduce a trusted central authority, or mint, that checks every
  58. transaction for double spending. After each transaction, the coin must be returned to the mint to
  59. issue a new coin, and only coins issued directly from the mint are trusted not to be double-spent.
  60. The problem with this solution is that the fate of the entire money system depends on the
  61. company running the mint, with every transaction having to go through them, just like a bank.
  62.  
  63. We need a way for the payee to know that the previous owners did not sign any earlier
  64. transactions. For our purposes, the earliest transaction is the one that counts, so we don't care
  65. about later attempts to double-spend. The only way to confirm the absence of a transaction is to
  66. be aware of all transactions. In the mint based model, the mint was aware of all transactions and
  67. decided which arrived first. To accomplish this without a trusted party, transactions must be
  68. publicly announced [1], and we need a system for participants to agree on a single history of the
  69. order in which they were received. The payee needs proof that at the time of each transaction, the
  70. majority of nodes agreed it was the first received.
  71.  
  72. 3. Timestamp Server
  73.  
  74. The solution we propose begins with a timestamp server. A timestamp server works by taking a
  75. hash of a block of items to be timestamped and widely publishing the hash, such as in a
  76. newspaper or Usenet post [2-5]. The timestamp proves that the data must have existed at the
  77. time, obviously, in order to get into the hash. Each timestamp includes the previous timestamp in
  78. its hash, forming a chain, with each additional timestamp reinforcing the ones before it.
  79.  
  80. 4. Proof-of-Work
  81.  
  82. To implement a distributed timestamp server on a peer-to-peer basis, we will need to use a proofof-work
  83. system similar to Adam Back's Hashcash [6], rather than newspaper or Usenet posts.
  84.  
  85. The proof-of-work involves scanning for a value that when hashed, such as with SHA-256, the
  86. hash begins with a number of zero bits. The average work required is exponential in the number
  87. of zero bits required and can be verified by executing a single hash.
  88.  
  89. For our timestamp network, we implement the proof-of-work by incrementing a nonce in the
  90. block until a value is found that gives the block's hash the required zero bits. Once the CPU
  91. effort has been expended to make it satisfy the proof-of-work, the block cannot be changed
  92. without redoing the work. As later blocks are chained after it, the work to change the block
  93. would include redoing all the blocks after it.
  94.  
  95. The proof-of-work also solves the problem of determining representation in majority decision
  96. making. If the majority were based on one-IP-address-one-vote, it could be subverted by anyone
  97. able to allocate many IPs. Proof-of-work is essentially one-CPU-one-vote. The majority
  98. decision is represented by the longest chain, which has the greatest proof-of-work effort invested
  99. in it. If a majority of CPU power is controlled by honest nodes, the honest chain will grow the
  100. fastest and outpace any competing chains. To modify a past block, an attacker would have to
  101. redo the proof-of-work of the block and all blocks after it and then catch up with and surpass the
  102. work of the honest nodes. We will show later that the probability of a slower attacker catching up
  103. diminishes exponentially as subsequent blocks are added.
  104.  
  105. To compensate for increasing hardware speed and varying interest in running nodes over time,
  106. the proof-of-work difficulty is determined by a moving average targeting an average number of
  107. blocks per hour. If they're generated too fast, the difficulty increases.
  108.  
  109. 5. Network
  110.  
  111. The steps to run the network are as follows:
  112.  
  113. 1) New transactions are broadcast to all nodes.
  114.  
  115. 2) Each node collects new transactions into a block.
  116.  
  117. 3) Each node works on finding a difficult proof-of-work for its block.
  118.  
  119. 4) When a node finds a proof-of-work, it broadcasts the block to all nodes.
  120.  
  121. 5) Nodes accept the block only if all transactions in it are valid and not already spent.
  122.  
  123. 6) Nodes express their acceptance of the block by working on creating the next block in the
  124. chain, using the hash of the accepted block as the previous hash.
  125.  
  126. Nodes always consider the longest chain to be the correct one and will keep working on
  127. extending it. If two nodes broadcast different versions of the next block simultaneously, some
  128. nodes may receive one or the other first. In that case, they work on the first one they received,
  129. but save the other branch in case it becomes longer. The tie will be broken when the next proofof-work
  130. is found and one branch becomes longer; the nodes that were working on the other
  131. branch will then switch to the longer one.
  132.  
  133. New transaction broadcasts do not necessarily need to reach all nodes. As long as they reach
  134. many nodes, they will get into a block before long. Block broadcasts are also tolerant of dropped
  135. messages. If a node does not receive a block, it will request it when it receives the next block and
  136. realizes it missed one.
  137.  
  138. 6. Incentive
  139.  
  140. By convention, the first transaction in a block is a special transaction that starts a new coin owned
  141. by the creator of the block. This adds an incentive for nodes to support the network, and provides
  142. a way to initially distribute coins into circulation, since there is no central authority to issue them.
  143.  
  144. The steady addition of a constant of amount of new coins is analogous to gold miners expending
  145. resources to add gold to circulation. In our case, it is CPU time and electricity that is expended.
  146. The incentive can also be funded with transaction fees. If the output value of a transaction is
  147. less than its input value, the difference is a transaction fee that is added to the incentive value of
  148. the block containing the transaction. Once a predetermined number of coins have entered
  149. circulation, the incentive can transition entirely to transaction fees and be completely inflation
  150. free.
  151.  
  152. The incentive may help encourage nodes to stay honest. If a greedy attacker is able to
  153. assemble more CPU power than all the honest nodes, he would have to choose between using it
  154. to defraud people by stealing back his payments, or using it to generate new coins. He ought to
  155. find it more profitable to play by the rules, such rules that favor him with more new coins than
  156. everyone else combined, than to undermine the system and the validity of his own wealth.
  157.  
  158. 7. Reclaiming Disk Space
  159.  
  160. Once the latest transaction in a coin is buried under enough blocks, the spent transactions before
  161. it can be discarded to save disk space. To facilitate this without breaking the block's hash,
  162. transactions are hashed in a Merkle Tree [7][2][5], with only the root included in the block's hash.
  163. Old blocks can then be compacted by stubbing off branches of the tree. The interior hashes do
  164. not need to be stored.
  165.  
  166. A block header with no transactions would be about 80 bytes. If we suppose blocks are
  167. generated every 10 minutes, 80 bytes * 6 * 24 * 365 = 4.2MB per year. With computer systems
  168. typically selling with 2GB of RAM as of 2008, and Moore's Law predicting current growth of
  169. 1.2GB per year, storage should not be a problem even if the block headers must be kept in
  170. memory.
  171.  
  172. 8. Simplified Payment Verification
  173.  
  174. It is possible to verify payments without running a full network node. A user only needs to keep
  175. a copy of the block headers of the longest proof-of-work chain, which he can get by querying
  176. network nodes until he's convinced he has the longest chain, and obtain the Merkle branch
  177. linking the transaction to the block it's timestamped in. He can't check the transaction for
  178. himself, but by linking it to a place in the chain, he can see that a network node has accepted it,
  179. and blocks added after it further confirm the network has accepted it.
  180.  
  181. As such, the verification is reliable as long as honest nodes control the network, but is more
  182. vulnerable if the network is overpowered by an attacker. While network nodes can verify
  183. transactions for themselves, the simplified method can be fooled by an attacker's fabricated
  184. transactions for as long as the attacker can continue to overpower the network. One strategy to
  185. protect against this would be to accept alerts from network nodes when they detect an invalid
  186. block, prompting the user's software to download the full block and alerted transactions to
  187. confirm the inconsistency. Businesses that receive frequent payments will probably still want to
  188. run their own nodes for more independent security and quicker verification.
  189.  
  190. 9. Combining and Splitting Value
  191.  
  192. Although it would be possible to handle coins individually, it would be unwieldy to make a
  193. separate transaction for every cent in a transfer. To allow value to be split and combined,
  194. transactions contain multiple inputs and outputs. Normally there will be either a single input
  195. from a larger previous transaction or multiple inputs combining smaller amounts, and at most two
  196. outputs: one for the payment, and one returning the change, if any, back to the sender.
  197. It should be noted that fan-out, where a transaction depends on several transactions, and those
  198. transactions depend on many more, is not a problem here. There is never the need to extract a
  199. complete standalone copy of a transaction's history.
  200.  
  201. 10. Privacy
  202.  
  203. The traditional banking model achieves a level of privacy by limiting access to information to the
  204. parties involved and the trusted third party. The necessity to announce all transactions publicly
  205. precludes this method, but privacy can still be maintained by breaking the flow of information in
  206. another place: by keeping public keys anonymous. The public can see that someone is sending
  207. an amount to someone else, but without information linking the transaction to anyone. This is
  208. similar to the level of information released by stock exchanges, where the time and size of
  209. individual trades, the "tape", is made public, but without telling who the parties were.
  210.  
  211. As an additional firewall, a new key pair should be used for each transaction to keep them
  212. from being linked to a common owner. Some linking is still unavoidable with multi-input
  213. transactions, which necessarily reveal that their inputs were owned by the same owner. The risk
  214. is that if the owner of a key is revealed, linking could reveal other transactions that belonged to
  215. the same owner.
  216.  
  217. 11. Calculations
  218.  
  219. We consider the scenario of an attacker trying to generate an alternate chain faster than the honest
  220. chain. Even if this is accomplished, it does not throw the system open to arbitrary changes, such
  221. as creating value out of thin air or taking money that never belonged to the attacker. Nodes are
  222. not going to accept an invalid transaction as payment, and honest nodes will never accept a block
  223. containing them. An attacker can only try to change one of his own transactions to take back
  224. money he recently spent.
  225.  
  226. The race between the honest chain and an attacker chain can be characterized as a Binomial
  227. Random Walk. The success event is the honest chain being extended by one block, increasing its
  228. lead by +1, and the failure event is the attacker's chain being extended by one block, reducing the
  229. gap by -1.
  230.  
  231. The probability of an attacker catching up from a given deficit is analogous to a Gambler's
  232. Ruin problem. Suppose a gambler with unlimited credit starts at a deficit and plays potentially an
  233. infinite number of trials to try to reach breakeven. We can calculate the probability he ever
  234. reaches breakeven, or that an attacker ever catches up with the honest chain, as follows [8]:
  235.  
  236. P equals probability an honest node finds the next block
  237.  
  238. Q equals probability the attacker finds the next block
  239.  
  240. Q Z equals probability the attacker will ever catch up from z blocks behind
  241.  
  242. Q Z equals one if P is less than or equal to Q
  243.  
  244. Q Z equals Q divided by P raised to the Z if P is less than Q
  245.  
  246. Given our assumption that p > q, the probability drops exponentially as the number of blocks the
  247. attacker has to catch up with increases. With the odds against him, if he doesn't make a lucky
  248. lunge forward early on, his chances become vanishingly small as he falls further behind.
  249.  
  250. We now consider how long the recipient of a new transaction needs to wait before being
  251. sufficiently certain the sender can't change the transaction. We assume the sender is an attacker
  252. who wants to make the recipient believe he paid him for a while, then switch it to pay back to
  253. himself after some time has passed. The receiver will be alerted when that happens, but the
  254. sender hopes it will be too late.
  255.  
  256. The receiver generates a new key pair and gives the public key to the sender shortly before
  257. signing. This prevents the sender from preparing a chain of blocks ahead of time by working on
  258. it continuously until he is lucky enough to get far enough ahead, then executing the transaction at
  259. that moment. Once the transaction is sent, the dishonest sender starts working in secret on a
  260. parallel chain containing an alternate version of his transaction.
  261.  
  262. The recipient waits until the transaction has been added to a block and z blocks have been
  263. linked after it. He doesn't know the exact amount of progress the attacker has made, but
  264. assuming the honest blocks took the average expected time per block, the attacker's potential
  265. progress will be a Poisson distribution with expected value:
  266.  
  267. lambda equals Z times Q divided by P
  268.  
  269. To get the probability the attacker could still catch up now, we multiply the Poisson density for
  270. each amount of progress he could have made by the probability he could catch up from that point:
  271.  
  272. MATH
  273.  
  274. Rearranging to avoid summing the infinite tail of the distribution...
  275.  
  276. MATH
  277.  
  278. Converting to C code...
  279.  
  280. CODE
  281.  
  282. Running some results, we can see the probability drop off exponentially with z.
  283.  
  284. 12. Conclusion
  285.  
  286. We have proposed a system for electronic transactions without relying on trust. We started with
  287. the usual framework of coins made from digital signatures, which provides strong control of
  288. ownership, but is incomplete without a way to prevent double-spending. To solve this, we
  289. proposed a peer-to-peer network using proof-of-work to record a public history of transactions
  290. that quickly becomes computationally impractical for an attacker to change if honest nodes
  291. control a majority of CPU power. The network is robust in its unstructured simplicity. Nodes
  292. work all at once with little coordination. They do not need to be identified, since messages are
  293. not routed to any particular place and only need to be delivered on a best effort basis. Nodes can
  294. leave and rejoin the network at will, accepting the proof-of-work chain as proof of what
  295. happened while they were gone. They vote with their CPU power, expressing their acceptance of
  296. valid blocks by working on extending them and rejecting invalid blocks by refusing to work on
  297. them. Any needed rules and incentives can be enforced with this consensus mechanism.
Advertisement
Add Comment
Please, Sign In to add comment