Jump to content

General game playing

From Wikipedia, the free encyclopedia

General game playing (GGP) is the design of artificial intelligence programs to be able to play more than one game successfully.[1][2][3] For many games like chess, computers are programmed to play these games using a specially designed algorithm, which cannot be transferred to another context. For instance, a chess-playing computer program cannot play checkers. General game playing is considered as a necessary milestone on the way to artificial general intelligence.[4]

General game playing algorithms are typically restrained to games with finite possible moves that take place synchronously between players. These games are described in terms of a start state, possible intermediary states, and a goal.

The first commercial usage of general game playing technology was Zillions of Games in 1998. General game playing was also proposed for trading agents in supply chain management there under price negotiation in online auctions from 2003 onwards.[5][6][7][8]

History

[edit]

Origins (1958–1990s)

[edit]

Researchers at the RAND Corporation first described a General Problem Solver, which they called "General Problem Solving Program I" (GPS), in 1958. The goal of their project was to solve problems, which they defined broadly as "whenever a problem solver desires some outcome or state of affairs that he does not immediately know how to attain." The RAND group emphasized the importance of heuristic methods in problem solving, and described a theoretically ideal problem-solving process as a cycle of discovery, information, and conclusion through educated guesswork.[9]

RAND's GPS algorithm grew out of the more specialized "Logic Theorist" algorithm, which produced proofs in propositional logic. Like the Logic Theorist, GPS relied on the creation of intermediary goals through means–ends analysis. In particular, the algorithm was capable of solving Symbolic Logic problems by using heuristic methods which the researchers derived from the problem solving strategies of actual human test subjects. Central to the model's mechanisms for simplification was the method of abstraction, in which the logical elements were replaced by more basic elements which functioned similarly in order to simplify problems.[9] Although the algorithm is theoretically capable of solving any sufficiently formalized problem in the form of well-formed formulae and directed graphs, in practice the amount of computation required for the algorithm to adequately explore the graph is infeasible to implement due to the combinatorial explosion.

In 1968, Jacques Pitrat founded the field of General Game Playing in a paper in the proceedings of the International Federation for Information Processing.[10] Pitrat later worked on the more specialized problem of chess-playing algorithms, but his earliest contribution to game playing algorithm development was to separate the subclass of general game players from the broader space of general problem solvers. Pitrat's game playing algorithm was theoretically capable of playing chess-like games, which could be described using a simple set of rule variations.

Formalization and competition (1992–2016)

[edit]

In 1992, Barney Pell defined the concept of Meta-Game Playing and developed the "MetaGame" system. This was the first program to automatically generate chess-like game rules, and one of the earliest programs to use automated game generation. Pell then developed the system Metagamer.[11] This system was able to play a number of chess-like games without any human interaction once the games were generated.[12]

In 1998, the commercial system Zillions of Games was developed by Jeff Mallett and Mark Lefler. The system used a LISP-like language to define the game rules. Zillions of Games derived the evaluation function automatically from the game rules based on piece mobility, board structure and game goals. It also employed usual algorithms as found in computer chess systems: alpha–beta pruning with move ordering, transposition tables, etc.[13] The package was extended in 2007 by the addition of the Axiom plug-in, an alternate metagame engine that incorporates a complete Forth-based programming language.

In 1998, z-Tree was developed by Urs Fischbacher.[14] z-Tree is the first and the most cited software tool for experimental economics. z-Tree allows the definition of game rules in z-Tree-language for game-theoretic experiments with human subjects. It also allows definition of computer players, which participate in a play with human subjects.[15]

In 2005, the Stanford Project General Game Playing was established.[16] This included the General Game Playing Competition, which was launched at the associated AAAI conference. The competition ran annually and further establish GGP as a research field. It was suspended after 2016, by which point it was considered to have largely served its original purpose of stimulating increased research in the area.[17] The Stanford project also produced Game Description Language (GDL), which allowed for the production and description of game variations and was used extensively in the General Game Playing Competition.[18]

In 2012, the development of PyVGDL (Python Video Game Description Language) started, and has now migrated to PyVGDL 2.0.[19][20] PyVGDL is a library made for the programming language Python that functions as a game engine, enabling the integration of various AI applications in tandem with simple game rules to create complex games. The library is frequently used in artificial intelligence research centered around General Game Playing problems due to its dynamic capabilities.[21]

Deep reinforcement learning

[edit]

In 2018, Google DeepMind released a paper describing their new AlphaZero algorithm.[22] The algorithm built on their earlier game-specific work with AlphaGo, and used a Monte Carlo tree search algorithm. AlphaZero met and even surpassed the performance of specialized game playing algorithms, and reviewers characterized it as a notable step toward general game-playing systems.[23][24]

In 2020, the DeepMind group that had produced AlphaZero published their work on the MuZero algorithm.[25]While AlphaZero required knowledge of the rules of games before playing, MuZero was able to match its performance without being given any prior information on the underlying structure of the game it was playing. The authors of the original MuZero paper noted that the lack of a need for pre-set rules potentially made the algorithm more applicable to use in real-world situations, in which there are no known governing principles or underlying rules.[25]

New description languages (2019-2020)

[edit]

Later research efforts produced new game-description languages which addressed the limitations of GDL. In 2019, researchers at the University of Wrocław created Regular Boardgames (RBG), a computer language based on the theory of regular languages. RBG encodes board game rules as regular expressions, and was designed for expressiveness, naturalness, and efficiency. Although built for usage in board games specifically, it is universally capable of describing any finite deterministic turn-based game with perfect information.[26] The Wrocław group's paper reported that it was the first GGP language able to efficiently encode and play games with complex rules and large numbers of legal moves in any given turn, such as Arimaa.[26] In 2020, Éric Piette, Cameron Browne and colleagues introduced Ludii, a general game system built on "ludemes" — units of game-related information — as part of the Digital Ludeme Project.[27] Ludii was designed to model a wide range of games, including ones which are no longer actively played, and to study their structure and history. Ludii matched the performance of previous descriptive schemes such as RBG and, because of its high level structure, produced game descriptions that were generally more human-readable compared to other systems.[27]

Large Language Models (2020s)

[edit]

In the early 2020s, researchers started evaluating large language models (LLMs) as general game players, testing their ability to reason strategically and make choices in games they were not specifically trained on. In 2024, Oguzhan Topsakal and colleagues introduced an open-source benchmark which could be used to compare the performance of LLMs in grid-based games such as Tic-Tac-Toe, Connect Four, and Gomoku. This benchmark made use of several different methods of describing game states to the models, comparing performance across methodologies.[28] In 2025, a team from the University of California, San Diego, Mohamed bin Zayed University of Artificial Intelligence, and the University of California, Berkeley introduced lmgame-Bench, which evaluated models' performance across a wide range of video game types.[29] This work emphasized that directly testing LLMs on games is difficult, identifying problems such as brittle visual perception, sensitivity to prompt wording, and the presence of preexisting gameplay strategy in training data.

GGP implementations

[edit]

Stanford project

[edit]

General Game Playing is a project of the Stanford Logic Group of Stanford University, California, which aims to create a platform for general game playing. It is the most well-known effort at standardizing GGP AI, and generally seen as the standard for GGP systems. The games are defined by sets of rules represented in the Game Description Language. In order to play the games, players interact with a game hosting server[30][31] that monitors moves for legality and keeps players informed of state changes.

From 2005 to 2016, annual General Game Playing competitions were held at the AAAI Conference. The competition judged competitor AI's abilities to play a variety of different games by recording their performance on each individual game. In the first stage of the competition, entrants were judged on their ability to perform legal moves, gain the upper hand, and complete games faster. In the following runoff round, the AIs faced off against each other in increasingly complex games. The AI that won the most games at this stage won the competition, and until 2013 its creator used to win a $10,000 prize.[32] The following programs were victorious:[33]

Year Name Developer Institution Ref
2005 Cluneplayer Jim Clune UCLA
2006 Fluxplayer Stephan Schiffel and Michael Thielscher Dresden University of Technology [34]
2007 Cadiaplayer Yngvi Björnsson and Hilmar Finnsson Reykjavik University [35]
2008 Cadiaplayer Yngvi Björnsson, Hilmar Finnsson and Gylfi Þór Guðmundsson Reykjavik University
2009 Ary Jean Méhat Paris 8 University
2010 Ary Jean Méhat Paris 8 University
2011 TurboTurtle Sam Schreiber
2012 Cadiaplayer Hilmar Finnsson and Yngvi Björnsson Reykjavik University
2013 TurboTurtle Sam Schreiber
2014 Sancho Steve Draper and Andrew Rose [36]
2015 Galvanise Richard Emslie
2016 WoodStock Eric Piette Artois University

Other approaches

[edit]

Other general game playing software that use their own languages for defining game rules include:

System Year Description
FRAMASI 2009 Developed for general game playing and economic experiments by Rustam Tagiew.[37][38]
AiAi 2015-2017 Developed by Stephen Tavener (previous Zillions developer).[39][40][41]
PolyGamo Player 2017 Released by David M. Bennett in September 2017 based on the Unity game engine.[42]
Regular Boardgames 2019 Developed by Jakub Kowalski, Marek Szykuła, and their team at University of Wrocław.[43][44]
Ludii 2020 Released by Cameron Browne and his team at Maastricht University as part of the ERC-funded Digital Ludeme Project.[45][46][47]

Algorithms

[edit]

Since GGP AI must be designed to play multiple games, its design cannot rely on algorithms created specifically for certain games. Instead, the AI must be designed using algorithms whose methods can be applied to a wide range of games. Recent GGP systems such as Regular Boardgames (RBG) and Ludii have explored alternative rule representations to optimize reasoning efficiency and support a broader variety of games. The AI must also be an ongoing process, that can adapt to its current state rather than the output of previous states. For this reason, open loop techniques are often most effective.[48] The Ludii software was used to find how the ancient Roman board game Ludus Coriovalli may have been played by referencing many similar games and using those to play possible rulesets. It was determined that it was most likely a two-player blocking game. Previously, the oldest known examples of such dated to medieval times.[49][50]

A popular method for developing GGP AI is the Monte Carlo tree search (MCTS) algorithm.[51] Often used together with the UCT method (Upper Confidence Bound applied to Trees), variations of MCTS have been proposed to better play certain games, as well as to make it compatible with video game playing.[52][53][54] Another variation of tree-search algorithms used is the Directed Breadth-first Search (DBS),[55] in which a child node to the current state is created for each available action, and visits each child ordered by highest average reward, until either the game ends or runs out of time.[56] In each tree-search method, the AI simulates potential actions and ranks each based on the average highest reward of each path, in terms of points earned.[51][56]

Assumptions

[edit]

In order to interact with games, algorithms must operate under the assumption that games all share common characteristics. In the book Half-Real: Video Games Between Real Worlds and Fictional Worlds, Jesper Juul gives the following definition of games: Games are based on rules, they have variable outcomes, different outcomes give different values, player effort influences outcomes, the player is attached to the outcomes, and the game has negotiable consequences.[57] Using these assumptions, game playing AI can be created by quantifying the player input, the game outcomes, and how the various rules apply, and using algorithms to compute the most favorable path.[58]

General video game playing

[edit]

General video game playing (GVGP) is the concept of GGP adjusted to the purpose of playing video games. For video games, game rules have to be either learnt over multiple iterations by artificial players like TD-Gammon,[59] or are predefined manually in a domain-specific language and sent in advance to artificial players[60][61] like in traditional GGP. Starting in 2013, significant progress was made following the deep reinforcement learning approach, including the development of programs that can learn to play Atari 2600 games[62][59][63][64][65] as well as a program that can learn to play Nintendo Entertainment System games.[66][67][68]

GVGP implementations

[edit]

Reinforcement learning

[edit]

GVGP could potentially be used to create real video game AI automatically, as well as "to test game environments, including those created automatically using procedural content generation and to find potential loopholes in the gameplay that a human player could exploit".[61] GVGP has also been used to generate game rules, and estimate a game's quality based on Relative Algorithm Performance Profiles (RAPP), which compare the skill differentiation that a game allows between good AI and bad AI.[69]

Video Game Description Language

[edit]

The General Video Game AI Competition (GVGAI) has been running since 2014. In this competition, two-dimensional video games similar to (and sometimes based on) 1980s-era arcade and console games are used instead of the board games used in the GGP competition. It has offered a way for researchers and practitioners to test and compare their best general video game playing algorithms. The competition has an associated software framework including a large number of games written in the Video Game Description Language (VGDL), which should not be confused with GDL and is a coding language using simple semantics and commands that can easily be parsed. One example for VGDL is PyVGDL developed in 2013.[60][70] The games used in GVGP are, for now, often 2-dimensional arcade games, as they are the simplest and easiest to quantify.[58] To simplify the process of creating an AI that can interpret video games, games for this purpose are written in VGDL manually.[clarification needed] VGDL can be used to describe a game specifically for procedural generation of levels, using Answer Set Programming (ASP) and an Evolutionary Algorithm (EA). GVGP can then be used to test the validity of procedural levels, as well as the difficulty or quality of levels based on how an agent performed.[71]

See also

[edit]

References

[edit]
  1. ^ Pell, Barney (1992). H. van den Herik; L. Allis (eds.). "Metagame: a new challenge for games and learning" [Heuristic programming in artificial intelligence 3–the third computerolympiad] (PDF). Ellis-Horwood. Archived (PDF) from the original on 2020-02-17. Retrieved 2020-02-17.
  2. ^ Pell, Barney (1996). "A Strategic Metagame Player for General Chess-Like Games". Computational Intelligence. 12 (1): 177–198. doi:10.1111/j.1467-8640.1996.tb00258.x. ISSN 1467-8640. S2CID 996006.
  3. ^ Genesereth, Michael; Love, Nathaniel; Pell, Barney (15 June 2005). "General Game Playing: Overview of the AAAI Competition". AI Magazine. 26 (2): 62. doi:10.1609/aimag.v26i2.1813. ISSN 2371-9621.
  4. ^ Canaan, Rodrigo; Salge, Christoph; Togelius, Julian; Nealen, Andy (2019). Proceedings of the 14th International Conference on the Foundations of Digital Games [Proceedings of the 14th International Conference on the Leveling the playing field: fairness in AI versus human game benchmarks]. pp. 1–8. doi:10.1145/3337722. ISBN 9781450372176. S2CID 58599284.
  5. ^ McMillen, Colin (2003). Toward the Development of an Intelligent Agent for the Supply Chain Management Game of the 2003 Trading Agent Competition [2003 Trading Agent Competition] (Thesis). Master's Thesis. Minneapolis, MN: University of Minnesota. S2CID 167336006.
  6. ^ Zhang, Dongmo (2009). From general game descriptions to a market specification language for general trading agents [Agent-mediated electronic commerce. Designing trading strategies and mechanisms for electronic markets.]. Berlin, Heidelberg: Springer. pp. 259–274. Bibcode:2010aecd.book..259T. CiteSeerX 10.1.1.467.4629.
  7. ^ "AGAPE - An Auction LanGuage for GenerAl Auction PlayErs". AGAPE (in French). 8 March 2019. Archived from the original on 2 August 2021. Retrieved 5 March 2020.
  8. ^ Michael, Friedrich; Ignatov, Dmitry (2019). "General Game Playing B-to-B Price Negotiations" (PDF). CEUR Workshop Proceedings. -2479: 89–99. Archived (PDF) from the original on 6 December 2019. Retrieved 5 March 2020.
  9. ^ a b Genesereth, Michael (2026-04-01). "General Game Playing". Communications of the ACM. 69 (4): 72–79. doi:10.1145/3747852. ISSN 0001-0782.
  10. ^ "Sage Journals: Discover world-class research". Sage Journals. doi:10.3233/ICG-190142. Retrieved 2026-05-29.
  11. ^ Barney Pell's research on computer game playing Archived 2007-08-12 at the Wayback Machine.
  12. ^ "Metagame and General Game Playing". Metagame and General Game Playing. Archived from the original on 3 March 2001. Retrieved 27 March 2016.
  13. ^ Available: Universal Game Engine Archived 2012-11-03 at the Wayback Machine email to comp.ai.games by Jeff Mallett, 10-Dec-1998.
  14. ^ "UZH - z-Tree - Zurich Toolbox for Readymade Economic Experiments". www.ztree.uzh.ch. Archived from the original on 21 February 2016. Retrieved 17 February 2020.
  15. ^ Beckenkamp, Martin; Hennig-Schmidt, Heike; Maier-Rigaud, Frank P. (1 March 2007). "Cooperation in Symmetric and Asymmetric Prisoner's Dilemma Games". Social Science Research Network. SSRN 968942.
  16. ^ Genesereth, Michael; Love, Nathaniel; Pell, Barney (15 June 2005). "General Game Playing: Overview of the AAAI Competition". AI Magazine. 26 (2): 62. doi:10.1609/aimag.v26i2.1813. ISSN 2371-9621.
  17. ^ Genesereth, Michael; Björnsson, Yngvi (June 2013). "The International General Game Playing Competition". AI Magazine. 34 (2): 107–111. doi:10.1609/aimag.v34i2.2475. ISSN 0738-4602.
  18. ^ http://ggp.stanford.edu/overview/ggp.pdf
  19. ^ Schaul, Tom (7 February 2020). "schaul/py-vgdl". GitHub. Archived from the original on 11 June 2018. Retrieved 9 February 2020.
  20. ^ Vereecken, Ruben (2026-05-04), rubenvereecken/py-vgdl, retrieved 2026-05-29
  21. ^ Schaul, Tom (2013-08-17). "A video game description language for model-based or interactive learning". 2013 IEEE Conference on Computational Inteligence in Games (CIG): 1–8. doi:10.1109/CIG.2013.6633610.
  22. ^ "Science". AAAS. doi:10.1126/science.aar6404. Retrieved 2026-05-29.
  23. ^ "Science". AAAS. doi:10.1126/science.aav1175. Retrieved 2026-05-29.
  24. ^ "Science". AAAS. doi:10.1126/science.aar6404. Retrieved 2026-05-29.
  25. ^ a b Schrittwieser, Julian; Antonoglou, Ioannis; Hubert, Thomas; Simonyan, Karen; Sifre, Laurent; Schmitt, Simon; Guez, Arthur; Lockhart, Edward; Hassabis, Demis; Graepel, Thore; Lillicrap, Timothy; Silver, David (2020-12-24). "Mastering Atari, Go, chess and shogi by planning with a learned model". Nature. 588 (7839): 604–609. doi:10.1038/s41586-020-03051-4. ISSN 0028-0836.
  26. ^ a b Kowalski, Jakub; Mika, Maksymilian; Sutowicz, Jakub; Szykuła, Marek (2019-07-17). "Regular Boardgames". Proceedings of the AAAI Conference on Artificial Intelligence. 33 (01): 1699–1706. doi:10.1609/aaai.v33i01.33011699. ISSN 2374-3468.
  27. ^ a b Piette, Éric; Soemers, Dennis J. N. J.; Stephenson, Matthew; Sironi, Chiara F.; Winands, Mark H. M.; Browne, Cameron (2020-02-21), Ludii -- The Ludemic General Game System, arXiv, doi:10.48550/arXiv.1905.05013, arXiv:1905.05013, retrieved 2026-05-29
  28. ^ Topsakal, Oguzhan; Edell, Colby Jacob; Harper, Jackson Bailey (2024-07-11), Evaluating Large Language Models with Grid-Based Game Competitions: An Extensible LLM Benchmark and Leaderboard, arXiv, doi:10.48550/arXiv.2407.07796, arXiv:2407.07796, retrieved 2026-05-29
  29. ^ Hu, Lanxiang; Huo, Mingjia; Zhang, Yuxuan; Yu, Haoyang; Xing, Eric P.; Stoica, Ion; Rosing, Tajana; Jin, Haojian; Zhang, Hao (2025-06-03), lmgame-Bench: How Good are LLMs at Playing Games?, arXiv, doi:10.48550/arXiv.2505.15146, arXiv:2505.15146, retrieved 2026-05-29
  30. ^ GGP Server Archived 2014-02-21 at the Wayback Machine, platform for competition of general game playing systems.
  31. ^ Dresden GGP Server Archived 2013-04-07 at the Wayback Machine, platform for competition of general game playing systems with automatic scheduling of matches.
  32. ^ Barney Pell's research on computer game playing Archived 2007-08-12 at the Wayback Machine.
  33. ^ "General Game Playing". www.general-game-playing.de. Archived from the original on 2008-12-26. Retrieved 2008-08-21.
  34. ^ Information about Fluxplayer Archived 2011-07-19 at the Wayback Machine, the winner of the 2nd International General Game Playing competition.
  35. ^ Information about CADIAPlayer Archived 2011-07-22 at the Wayback Machine, more information about the winner of the 3rd, 4th, and 8th International General Game Playing competitions.
  36. ^ Sancho is GGP Champion 2014! Archived 2015-12-22 at the Wayback Machine, winner of the 2014 International General Game Playing competition.
  37. ^ Tagiew, Rustam (2009). Filipe, Joaquim; Fred, Ana; Sharp, Bernadette (eds.). Towards a framework for management of strategic interaction [Proceedings of the International Conference on Agents and Artificial Intelligence] (PDF). Porto, Portugal. pp. 587–590. ISBN 978-989-8111-66-1. Archived (PDF) from the original on 2021-03-09. Retrieved 2021-06-02.{{cite book}}: CS1 maint: location missing publisher (link)
  38. ^ Tagiew, Rustam (2011). Strategische Interaktion realer Agenten Ganzheitliche Konzeptualisierung und Softwarekomponenten einer interdisziplinären Forschungsinfrastruktur (neue Ausg ed.). Saarbrücken. ISBN 9783838125121.{{cite book}}: CS1 maint: location missing publisher (link)
  39. ^ "Zillions of Games - Who Are We?". www.zillions-of-games.com. Archived from the original on 2017-11-15. Retrieved 2017-11-16.
  40. ^ "AiAi Home Page – Stephen Tavener". mrraow.com. Archived from the original on 2015-09-06. Retrieved 2017-11-16.
  41. ^ "Ai Ai announcement thread". BoardGameGeek. Archived from the original on 2017-11-16. Retrieved 2017-11-16.
  42. ^ "The PolyGamo Player Project | Programming Languages and General Players for Abstract Games and Puzzles". www.polyomino.com. Archived from the original on 2002-09-23. Retrieved 2017-11-16.
  43. ^ Kowalski, Jakub; Mika, Maksymilian; Sutowicz, Jakub; Szykuła, Marek (2019-07-17). "Regular Boardgames". Proceedings of the AAAI Conference on Artificial Intelligence. 33 (1): 1699–1706. doi:10.1609/aaai.v33i01.33011699. ISSN 2374-3468. S2CID 20296467.
  44. ^ Kowalski, Jakub; Miernik, Radoslaw; Mika, Maksymilian; Pawlik, Wojciech; Sutowicz, Jakub; Szykula, Marek; Tkaczyk, Andrzej (2020). "Efficient Reasoning in Regular Boardgames". 2020 IEEE Conference on Games (CoG). pp. 455–462. arXiv:2006.08295. doi:10.1109/cog47356.2020.9231668. ISBN 978-1-7281-4533-4. S2CID 219687404.
  45. ^ "Ludii Portal | Home of the Ludii General Game System". www.ludii.games. Archived from the original on 2021-10-27. Retrieved 2021-10-27.
  46. ^ "Digital Ludeme Project | Modelling the Evolution of Traditional Games". www.ludeme.eu. Archived from the original on 2021-10-02. Retrieved 2021-10-27.
  47. ^ Piette, E.; Soemers, D. J. N. J.; Stephenson, M.; Sironi, C.; Stephenson, M.; Winands M. H. M.; Browne, C. (2020). "Ludii – The Ludemic General Game System" (PDF). European Conference on Artificial Intelligence (ECAI 2020), Santiago de Compestela. Archived (PDF) from the original on 2022-01-21. Retrieved 2021-10-27.
  48. ^ Świechowski, Maciej; Park, Hyunsoo; Mańdziuk, Jacek; Kim, Kyung-Joong (2015). "Recent Advances in General Game Playing". The Scientific World Journal. 2015 986262. Hindawi Publishing Corporation. doi:10.1155/2015/986262. PMC 4561326. PMID 26380375.
  49. ^ Crist, Walter; Piette, Éric; Jeneson, Karen; Soemers, Dennis J. N. J.; Stephenson, Matthew; Goor, Luk van; Browne, Cameron (February 11, 2026). "Ludus Coriovalli: using artificial intelligence-driven simulations to identify rules for an ancient board game". Antiquity. 100 (409): 111–126. doi:10.15184/aqy.2025.10264. ISSN 0003-598X.
  50. ^ Anderson, Sonja. "This Ancient Roman Game Board Was a Mystery. Researchers Used A.I. to Figure Out How to Play". Smithsonian Magazine. Retrieved 2026-03-03.
  51. ^ a b "Monte-Carlo Tree Search for General Game Playing". ResearchGate. Retrieved 2016-04-01.
  52. ^ Finnsson, Hilmar (2012). "Generalized Monte-Carlo Tree Search Extensions for General Game Playing". Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence. Archived from the original on 2013-10-15. Retrieved 2016-04-09.
  53. ^ Frydenberg, Frederik; Anderson, Kasper R.; Risi, Sebastian; Togelius, Julian. "Investigating MCTS Modifications in General Video Game Playing" (PDF). Archived (PDF) from the original on 2016-04-12. Retrieved 2016-04-09.
  54. ^ M. Swiechowski; J. Mandziuk; Y. S. Ong, "Specialization of a UCT-based General Game Playing Program to Single-Player Games," in IEEE Transactions on Computational Intelligence and AI in Games, vol.PP, no.99, pp.1-1 doi:10.1109/TCIAIG.2015.2391232
  55. ^ "Changing the root node from a previous game step". Archived from the original on 2021-01-17. DBS: A Directed Breadth First Search (DBS) algorithm
  56. ^ a b Perez, Diego; Dieskau, Jens; Hünermund, Martin. "Open Loop Search for General Video Game Playing" (PDF). Archived (PDF) from the original on 2016-03-28. Retrieved 2016-04-09.
  57. ^ Jesper Juul. Half-Real: Video Games Between Real Rules and Fictional Worlds. MIT Press, 2005.
  58. ^ a b Levine, John; Congdon, Clare Bates; Ebner, Marc; Kendall, Graham; Lucas, Simon M.; Miikkulainen Risto, Schaul; Tom, Thompson; Tommy. "General Video Game Playing" (PDF). Archived (PDF) from the original on 2016-04-18. Retrieved 2016-04-09.
  59. ^ a b Mnih, Volodymyr; Kavukcuoglu, Koray; Silver, David; Graves, Alex; Antonoglou, Ioannis; Wierstra, Daan; Riedmiller, Martin (2013). "Playing Atari with Deep Reinforcement Learning" (PDF). Neural Information Processing Systems Workshop 2013. Archived (PDF) from the original on 12 September 2014. Retrieved 25 April 2015.
  60. ^ a b Schaul, Tom (August 2013). "A video game description language for model-based or interactive learning". 2013 IEEE Conference on Computational Inteligence in Games (CIG). pp. 1–8. CiteSeerX 10.1.1.360.2263. doi:10.1109/CIG.2013.6633610. ISBN 978-1-4673-5311-3. S2CID 812565.
  61. ^ a b Levine, John; Congdon, Clare Bates; Ebner, Marc; Kendall, Graham; Lucas, Simon M.; Miikkulainen, Risto; Schaul, Tom; Thompson, Tommy (2013). "General Video Game Playing". Artificial and Computational Intelligence in Games. 6. Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik: 77–83. Archived from the original on 9 April 2016. Retrieved 25 April 2015.
  62. ^ Bowling, M.; Veness, J.; Naddaf, Y.; Bellemare, M. G. (2013-06-14). "The Arcade Learning Environment: An Evaluation Platform for General Agents". Journal of Artificial Intelligence Research. 47: 253–279. arXiv:1207.4708. doi:10.1613/jair.3912. ISSN 1076-9757. S2CID 1552061.
  63. ^ Mnih, Volodymyr; Kavukcuoglu, Koray; Silver, David; Rusu, Andrei A.; Veness, Joel; Hassabis, Demis; Bellemare, Marc G.; Graves, Alex; Riedmiller, Martin; Fidjeland, Andreas K.; Stig Petersen, Georg Ostrovski; Beattie, Charles; Sadik, Amir; Antonoglou, Ioannis; King, Helen; Kumaran, Dharshan; Wierstra, Daan; Legg, Shane (26 February 2015). "Human-level control through deep reinforcement learning". Nature. 518 (7540): 529–533. Bibcode:2015Natur.518..529M. doi:10.1038/nature14236. PMID 25719670. S2CID 205242740.
  64. ^ Korjus, Kristjan; Kuzovkin, Ilya; Tampuu, Ardi; Pungas, Taivo (2014). "Replicating the Paper "Playing Atari with Deep Reinforcement Learning"" (PDF). University of Tartu. Archived (PDF) from the original on 18 December 2014. Retrieved 25 April 2015.
  65. ^ Guo, Xiaoxiao; Singh, Satinder; Lee, Honglak; Lewis, Richard L.; Wang, Xiaoshi (2014). "Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning" (PDF). NIPS Proceedingsβ. Conference on Neural Information Processing Systems. Archived (PDF) from the original on 17 November 2015. Retrieved 25 April 2015.
  66. ^ Murphy, Tom (2013). "The First Level of Super Mario Bros. is Easy with Lexicographic Orderings and Time Travel ... after that it gets a little tricky." (PDF). SIGBOVIK. Archived (PDF) from the original on 26 April 2013. Retrieved 25 April 2015.
  67. ^ Murphy, Tom. "learnfun & playfun: A general technique for automating NES games". Archived from the original on 19 April 2015. Retrieved 25 April 2015.
  68. ^ Teller, Swizec (October 28, 2013). "Week 2: Level 1 of Super Mario Bros. is easy with lexicographic orderings and". A geek with a hat. Archived from the original on 30 April 2015. Retrieved 25 April 2015.
  69. ^ Nielsen, Thorbjørn S.; Barros, Gabriella A. B.; Togelius, Julian; Nelson, Mark J. "Towards generating arcade game rules with VGDL" (PDF). Archived (PDF) from the original on 2015-09-12. Retrieved 2018-02-24.
  70. ^ Schaul, Tom (7 February 2020). "schaul/py-vgdl". GitHub. Archived from the original on 11 June 2018. Retrieved 9 February 2020.
  71. ^ Neufeld, Xenija; Mostaghim, Sanaz; Perez-Liebana, Diego. "Procedural Level Generation with Answer Set Programming for General Video Game Playing" (PDF). Archived (PDF) from the original on 2016-03-28. Retrieved 2018-02-24.
[edit]