




已阅读5页,还剩1页未读, 继续免费阅读
版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
精选文库game theory is the science of strategy. It attempts to determine mathematically and logically the actions that “players” should take to secure the best outcomes for themselves in a wide array of “games.” The games it studies range from chess to child rearing and from tennis to takeovers. But the games all share the common feature of interdependence. That is, the outcome for each participant depends on the choices (strategies) of all. In so-called zero-sum games the interests of the players conflict totally, so that one persons gain always is anothers loss. More typical are games with the potential for either mutual gain (positive sum) or mutual harm (negative sum), as well as some conflict.都具有相互依赖的共同特征。也就是说,每个参与者的结果取决于所有人的选择(策略)。在所谓的零和游戏中,玩家的利益是完全冲突的博弈论是战略的科学。它试图从数学和逻辑上确定“玩家”应采取的行动,以确保他们在各种“游戏”中获得最佳成果。所研究的游戏包括从国际象棋到儿童饲养,从网球到收购。但是这些游戏,所以一个人的收益总是另一个人的损失。更典型的是有相互收益(正数)或相互伤害(负数)的博弈,以及一些冲突。Game theory was pioneered by Princeton mathematician john von Neumann. In the early years the emphasis was on games of pure conflict (zero-sum games). Other games were considered in a cooperative form. That is, the participants were supposed to choose and implement their actions jointly. Recent research has focused on games that are neither zero sum nor purely cooperative. In these games the players choose their actions separately, but their links to others involve elements of both competition and cooperation.博弈论由普林斯顿数学家约翰冯诺曼先生开创。早期的重点是纯粹的冲突游戏(零和游戏)。其他比赛以合作形式考虑。也就是说,参与者应该共同选择和实施他们的行动。最近的研究集中在既不是零和也不是纯合作的游戏。在这些游戏中,玩家分别选择他们的行为,但他们与其他人的联系涉及竞争与合作的要素。Games are fundamentally different from decisions made in a neutral environment. To illustrate the point, think of the difference between the decisions of a lumberjack and those of a general. When the lumberjack decides how to chop wood, he does not expect the wood to fight back; his environment is neutral. But when the general tries to cut down the enemys army, he must anticipate and overcome resistance to his plans. Like the general, a game player must recognize his interaction with other intelligent and purposive people. His own choice must allow both for conflict and for possibilities for cooperation.游戏与中性环境下的决策有着根本的区别。为了说明这一点,想一想伐木工人的决定与一般人的决定之间的区别。当伐木工人决定如何砍木头时,他并不指望木头能够反击;他的环境是中立的。但是当将军试图削减敌人的军队时,他必须预见并克服对他的计划的抵抗。和一般人一样,玩家必须认识到他与其他聪明和有目的的人的互动。他自己的选择必须同时允许冲突和合作的可能性。The essence of a game is the interdependence of player strategies. There are two distinct types of strategic interdependence: sequential and simultaneous. In the former the players move in sequence, each aware of the others previous actions. In the latter the players act at the same time, each ignorant of the others actions.游戏的本质是玩家策略的相互依赖性。战略相互依存有两种截然不同的类型:顺序式和同时式。在前者中,球员依次移动,每个人都意识到其他人以前的行为。在后者中,参与者同时行动,每个人都无知其他人的行为。A general principle for a player in a sequential-move game is to look ahead and reason back. Each player should figure out how the other players will respond to his current move, how he will respond in turn, and so on. The player anticipates where his initial decisions will ultimately lead and uses this information to calculate his current best choice. When thinking about how others will respond, he must put himself in their shoes and think as they would; he should not impose his own reasoning on them.玩家在顺序移动游戏中的一般原则是向前看,回头看。每个玩家都应该弄清楚其他玩家将如何回应他目前的行动,他将如何反应,等等。玩家预期他最初的决定将最终导致并使用这些信息来计算他当前的最佳选择。当想到别人会如何回应时,他必须放下自己的想法,按照自己的想法去思考;他不应该对他们施加他自己的推理。In principle, any sequential game that ends after a finite sequence of moves can be “solved” completely. We determine each players best strategy by looking ahead to every possible outcome. Simple games, such as tic-tac-toe, can be solved in this way and are therefore not challenging. For many other games, such as chess, the calculations are too complex to perform in practiceeven with computers. Therefore, the players look a few moves ahead and try to evaluate the resulting positions on the basis of experience.原则上,在有限的一系列动作之后结束的任何连续游戏都可以完全“解决”。我们通过展望每一个可能的结果来确定每个玩家的最佳策略。简单的游戏,如井字游戏,可以用这种方式解决,因此不具有挑战性。对于许多其他游戏,如国际象棋,计算过于复杂,无法在实践中执行 - 即使使用计算机。因此,球员们会看到前进的几步,并尝试根据经验评估所得到的位置。In contrast to the linear chain of reasoning for sequential games, a game with simultaneous moves involves a logical circle. Although the players act at the same time, in ignorance of the others current actions, each must be aware that there are other players who are similarly aware, and so on. The thinking goes: “I think that he thinks that I think . . .” Therefore, each must figuratively put himself in the shoes of all and try to calculate the outcome. His own best action is an integral part of this overall calculation.与连续游戏的线性推理链不同,具有同时移动的游戏涉及逻辑循环。虽然玩家同时行动,但无视别人目前的行为,每个人都必须意识到还有其他玩家同样意识到,等等。这个想法是:“我认为他认为我想。“因此,每个人都必须形象地把自己置于所有人的脚下,并试图计算结果。他自己的最佳行为是整体计算的一个组成部分。This logical circle is squared (the circular reasoning is brought to a conclusion) using a concept of equilibrium developed by the Princeton mathematician john nash. We look for a set of choices, one for each player, such that each persons strategy is best for him when all others are playing their stipulated best strategies. In other words, each picks his best response to what the others do.使用普林斯顿数学家约翰纳什开发的均衡概念,将这个逻辑圆平方(圆形推理得出结论)。我们寻找一套选择,每个选手都有一个选择,这样当其他人都在玩他们规定的最佳策略时,每个人的策略对他来说都是最好的。换句话说,每个人都会对他人所做的最好的回应。Sometimes one persons best choice is the same no matter what the others do. This is called a “dominant strategy” for that player. At other times, one player has a uniformly bad choicea “dominated strategy”in the sense that some other choice is better for him no matter what the others do. The search for an equilibrium should begin by looking for dominant strategies and eliminating dominated ones.无论别人做什么,有时一个人的最佳选择是一样的。这被称为该球员的“主导战略”。在其他时候,一个球员有一个统一的不好的选择 - 一个“主导策略” - 在某种意义上,无论别人怎么做,其他选择对他都更好。寻求均衡应首先寻找主导策略并消除主导策略。When we say that an outcome is an equilibrium, there is no presumption that each persons privately best choice will lead to a collectively optimal result. Indeed, there are notorious examples, such as the prisoners dilemma (see below), where the players are drawn into a bad outcome by each following his best private interests.当我们说结果是一种均衡时,并不假设每个人的私人最佳选择将导致集体最优结果。事实上,有一些臭名昭着的例子,比如囚徒困境(见下文),在这些情况下,玩家被各自追求最好的私人利益而陷入糟糕的结局。Nashs notion of equilibrium remains an incomplete solution to the problem of circular reasoning in simultaneous-move games. Some games have many such equilibria while others have none. And the dynamic process that can lead to an equilibrium is left unspecified. But in spite of these flaws, the concept has proved extremely useful in analyzing many strategic interactions.纳什的均衡概念仍然是解决同步移动游戏中循环推理问题的不完全解决方案。一些游戏有很多这样的均衡,而其他游戏则没有。并且可以导致均衡的动态过程未指定。但是,尽管存在这些缺陷,但这一概念在分析许多战略互动中证明是非常有用的。It is often thought that the application of game theory requires all players to be hyperrational. The theory makes no such claims. Players may be spiteful or envious as well as charitable and empathetic. Recall George Bernard Shaws amendment to the Golden Rule: “Do not do unto others as you would have them do unto you. Their tastes may be different.” In addition to different motivations, other players may have different information. When calculating an equilibrium or anticipating the response to your move, you always have to take the other players as they are, not as you are.人们经常认为,博弈论的应用要求所有参与者都是超理性的。这个理论没有提出这样的说法。玩家可能是恶毒或嫉妒,以及慈善和同情。回想萧伯纳对黄金法则的修正案:“不要像别人那样对待他人。他们的口味可能不同。“除了不同的动机外,其他玩家可能会有不同的信息。当计算均衡或预测对你的举动的反应时,你总是必须让其他玩家保持原样,而不是像现在这样。The following examples of strategic interaction illustrate some of the fundamentals of game theory.下面的战略交互例子说明了博弈论的一些基本原理。The prisoners dilemma. Two suspects are questioned separately, and each can confess or keep silent. If suspect A keeps silent, then suspect B can get a better deal by confessing. If A confesses, B had better confess to avoid especially harsh treatment. Confession is Bs dominant strategy. The same is true for A. Therefore, in equilibrium both confess. Both would fare better if they both stayed silent. Such cooperative behavior can be achieved in repeated plays of the game because the temporary gain from cheating (confession) can be outweighed by the long-run loss due to the breakdown of cooperation. Strategies such as tit-for-tat are suggested in this context.囚犯的困境。两名嫌疑人分别受到质疑,每个人都可以坦白或保持沉默。如果嫌疑人A保持沉默,那么怀疑B可以通过承认获得更好的交易。如果A承认,B最好承认避免特别苛刻的治疗。认罪是B的主导策略。A也是如此,因此在平衡中都承认。如果两人都保持沉默,两人的表现都会更好。这种合作行为可以在游戏的重复中实现,因为由于合作中断而造成的长期损失可以超过作弊(忏悔)的暂时收益。在这种情况下,建议采取针锋相对的策略。Mixing moves. In some situations of conflict, any systematic action will be discovered and exploited by the rival. Therefore, it is important to keep the rival guessing by mixing your moves. Typical examples arise in sportswhether to run or to pass in a particular situation in football, or whether to hit a passing shot crosscourt or down the line in tennis. Game theory quantifies this insight and details the right proportions of such mixtures.混合动作。在一些冲突的情况下,任何系统性行动都会被对手发现并利用。因此,通过混合你的动作来保持对手猜测是很重要的。典型的例子出现在体育运动中 -无论是在足球的特定情况下跑步还是传球,还是在网球场上击中传球射门或下线。博弈论量化了这种见解,并详细说明了这种混合物的正确比例。Strategic moves. A player can use threats and promises to alter other players expectations of his future actions, and thereby induce them to take actions favorable to him or deter them from making moves that harm him. To succeed, the threats and promises must be credible. This is problematic because when the time comes, it is generally costly to carry out a threat or make good on a promise. Game theory studies several ways to enhance credibility. The general principle is that it can be in a players interest to reduce his own freedom of future action. By so doing, he removes his own temptation to renege on a promise or to forgive others transgressions.战略举措。玩家可以使用威胁和承诺来改变其他玩家对未来行为的期望,从而诱使他们采取有利于他的行动,或阻止他们采取行动伤害他。要成功,威胁和承诺必须可信。这是有问题的,因为到时候,执行威胁或承诺承诺通常是昂贵的。博弈论研究几种提高可信度的方法。总的原则是,为了降低自己未来行动的自由,符合玩家的利益。通过这样做,他消除了自己的背叛,或者原谅别人的过失的诱惑。For example, Corts scuttled all but one of his own ships on his arrival in Mexico, purposefully eliminating retreat as an option. Without ships to sail home, Corts would either succeed in his conquest or perish. Although his soldiers were vastly outnumbered, this threat to fight to the death demoralized the opposition, who chose to retreat rather than fight such a determined opponent. Polaroid Corporation used a similar strategy when it purposefully refused to diversify out of the instant photography market. It was committed to a life-or-death battle against any intruder in the market. When Kodak entered the instant photography market, Polaroid put all its resources into the fight; fourteen years later, Polaroid won a nearly billion-dollar lawsuit against Kodak and regained its monopoly market. (Polaroids focus on instant film products later proved costly when the company failed to diversify into digital photography.)例如,科尔特斯在抵达墨西哥时凿沉了他自己的所有船只中的一艘,并有目的地消除了撤退。如果没有船舶回家,科尔特斯要么成功征服,要么灭亡。尽管他的士兵数量众多,但这种与死亡作斗争的威胁使反对派士气低落,他们选择撤退而不是与这样坚定的对手作战。宝丽来公司在有意拒绝从即时摄影市场多元化时采用了类似的策略。它致力于对抗市场上的任何入侵者的生死战。当柯达进入即时拍摄市场时,宝丽来将所有资源投入到战斗中; 14年后,宝丽来赢得了对柯达的近亿美元的诉讼并重新获得了垄断权市场。(宝丽来专注于即时胶片产品的后来证明,当该公司未能实现数字摄影多样化时,昂贵的成本。)Another way to make threats credible is to employ the adventuresome strategy of brinkmanshipdeliberately creating a risk that if other players fail to act as you would like them to, the outcome will be bad for everyone. Introduced by Thomas Schelling in The Strategy of Conflict, brinkmanship “is the tactic of deliberately letting the situation get somewhat out of hand, just because its being out of hand may be intolerable to the other party and force his accommodation.” When mass demonstrators confronted totalitarian governments in Eastern Europe and China, both sides were engaging in just such a strategy. Sometimes one side backs down and concedes defeat; sometimes tragedy results when they fall over the brink together.另一种使威胁可信的方法是采用冒险的冒险策略 - 故意制造一种风险,即如果其他玩家没有按照自己的愿望行事,结果将会对每个人都不利。托马斯谢林在“冲突战略”中介绍说,“边缘战术”是故意让局势略微失控的策略,仅仅是因为它的失控可能让对方无法忍受,并迫使他们的住所。“当群众示威者面对时东欧和中国的极权政府双方都在制定这样的战略。有时一方支持并承认失败; 有时候会一起陷入濒临崩溃的悲剧结局。Bargaining. Two players decide how to split a pie. Each wants a larger share, and both prefer to achieve agreement sooner rather than later. When the two take turns making offers, the principle of looking ahead and reasoning back determines the equilibrium shares. Agreement is reached at once, but the cost of delay governs the shares. The player more impatient to reach agreement gets a smaller share.讨价还价。两名球员决定如何分割一个馅饼。每个人都希望有更大的份额,并且都希望尽早达成协议,而不是晚些时候。当两人轮流提出要约时,展望未来和推理的原则决定了均衡份额。协议立即达成,但延迟的成本支配股份。玩家更不耐烦达成协议的份额较小。Concealing and rev
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 教师招聘之《小学教师招聘》考前冲刺模拟题库附答案详解(能力提升)
- 教师招聘之《幼儿教师招聘》考试押题密卷及参考答案详解(培优b卷)
- 建筑方案设计人员6
- 关于安全生产策划活动方案
- 数字孪生技术助力智慧城市建设2025年城市规划实践报告
- 上海豫园建筑营造方案设计
- 滨州医学院附属医院课件
- 新中式酒店建筑方案设计
- 单车道双拱桥施工方案
- 电焊工程安全教育培训课件
- JC-T 2113-2012普通装饰用铝蜂窝复合板
- JB T 6527-2006组合冷库用隔热夹芯板
- 2022上海秋季高考语文卷详解(附古诗文翻译)5
- 定制手办目标市场调研
- 新版规范(2017)沥青混凝土路面设计(详细应用)
- 机器学习基础讲义
- 铁路交通事故调查处理规定-事故调查
- 慢性鼻窦炎鼻息肉护理查房课件
- set2020标准文件编写工具软件使用说明书
- 中小学教师参与学校管理研究论文
- 动叶可调式轴流风机液压调节系统课件
评论
0/150
提交评论