Abstract Wikipedia

Abstract Wikipedia
Developer(s)Wikimedia Foundation
Websitemeta.wikimedia.org/wiki/Abstract_Wikipedia Edit this on Wikidata

Abstract Wikipedia is an in-development project of the Wikimedia Foundation. It aims to use Wikifunctions to create a language-independent version of Wikipedia using its structured data.[1] First conceived in 2020 (with a precursor proposal in 2013), Abstract Wikipedia has been under active development ever since, with the related project of Wikifunctions launched successfully in 2023. Nevertheless, the project has proved controversial. As envisioned, Abstract Wikipedia would consist of "Constructors" (templates for abstract statements), "Content" (the abstract statements themselves), and "Renderers" (which would automatically translate abstract statements into natural language).

History[edit]

Conception (2013–2020)[edit]

On 7 August 2013, Denny Vrandečić, the co-founder of Wikidata, suggested "an extension of the template system" where template calls would expand into content based on the language of the user.[2] For example, a template call such as {{F12:Q64|Q5519|Q183}} could be variously expanded by Template:F12/en into "Berlin is the capital of Germany.", and by Template:F12/de into "Berlin ist die Hauptstadt Deutschlands."[2] This has been viewed as a predecessor of Abstract Wikipedia proper.[3]

Vrandečić proposed it again in a Google working paper in April 2020,[4] formally proposed in May 2020 (as Wikilambda). It was approved by the Wikimedia Foundation Board of Trustees in July 2020 as Abstract Wikipedia.[5][6][7]

Development (2020–present)[edit]

In April 2021, Vrandečić published an overview of the system in the computer science journal Communications of the ACM.[8]

The Abstract Wikipedia team at a 2022 offsite in Switzerland. Left hand side on the table, from front to back: Ariel Gutman, Ori Livneh, Maria Keet, Sandy Woodruff, Mary Yang, Eunice Moon. At head of table: Rebecca Wambua. Right hand side of the table, from front to back: Olivia Zhang, Denny Vrandečić, Edmund Wright, Dani de Waal, Ali Assaf, James Forrester.
The Abstract Wikipedia team at a 2022 offsite in Switzerland

In January 2023, The Signpost reported on the slow progress of the Abstract Wikipedia project.[9] According to an evaluation by four Google Fellows working on the project, it was at a "substantial risk of failure" due to its poor technical plan.[9] The Google Fellows recommended that Abstract Wikipedia be decoupled from Wikifunctions, that Wikifunctions refine MediaWiki's support for programming in Lua rather than having a completely new language, and that Abstract Wikipedia converge on a unified approach to natural language generation (NLG) that builds on open source software if possible.[9]

The Wikimedia Foundation staff responded to this report by completely rejecting the idea that Abstract Wikipedia and Wikifunctions could be separated, and accusing the Google Fellows of making "fallacies and false comparisons".[9] The Wikimedia Foundation also stated that using existing NLG pipelines like Grammatical Framework could not support certain languages such as the Niger–Congo B languages, and would also "replicate the trends of an imperialist English-focused Western-thinking industry.".[9]

On 26 July 2023, Wikifunctions officially launched to the general public.[10]

Design[edit]

Technical components[edit]

A diagram of the Abstract Wikipedia technical plan. At the bottom layer is "Constructors: single set", which are illustrated with a cylinder labeled "Wikidata: lexical and ontological knowledge". In the middle layer are "Content: one per article", which are illustrated with boxes labeled "San Francisco", "Oxygen", "Marie Curie", and "Music". At the top layer is "Renderers: one per language", which are illustrated with "English", "German", and "Chinese" boxes.
A diagram of the Abstract Wikipedia technical plan

The Abstract Wikipedia project would consist of three main components:[11]

  1. Constructors, which enable abstract statements. The Abstract Wikipedia team prefers that these are hosted in Wikifunctions.
  2. Content, which consists of abstract calls to Constructors, with values for each slot. These are preferably hosted in Wikidata.
  3. Renderers (one per language), which convert the abstract Content into text in that particular language. These are, like Constructors, also preferably hosted in Wikifunctions.

Each version of Wikipedia, once Abstract Wikipedia is deployed, could choose between three options:[12]

  1. Implicit integration with Abstract Wikipedia. There would be a special page called Special:Abstract that would display content automatically translated from Abstract Wikipedia into the local language. This content would be linkable and searchable. Furthermore, a new magic word LINK_TO_Q would be added in order to enable linking to Abstract Wikipedia content.
  2. Explicit integration with Abstract Wikipedia. In this scenario, to create a new article, the editor would add a sitelink on Wikidata to a not-yet-existing page. This would create a "virtual article" in mainspace that would be pre-populated with content from Abstract Wikipedia automatically translated into the local language. This "virtual article" would have a URL similar to that of a real article, and would also be linkable and searchable just like a real article.
  3. No integration with Abstract Wikipedia.

Example[edit]

As a preliminary example, content from Abstract Wikipedia could look like:[13]

Article(   content: [     Instantiation(       instance: San Francisco (Q62),       class: Object_with_modifier_and_of(         object: center,         modifier: And_modifier(           conjuncts: [cultural, commercial, financial]         ),         of: Northern California (Q1066807)       )     ),     Ranking(       subject: San Francisco (Q62),       rank: 4,       object: City (Q515),       by: Population size (Q1613416),       local_constraint: California (Q99),       after: [Los Angeles (Q65), San Diego (Q16552), San Jose (Q16553)]     )   ] ) 

This would translate into English as "San Francisco is the cultural, commercial, and financial center of Northern California. It is the fourth-most populous city in California, after Los Angeles, San Diego and San Jose."

References[edit]

  1. ^ Hill, Paul (13 April 2020). "Wikidata founder floats idea for balanced multilingual Wikipedia". Neowin. Retrieved 2 July 2020.
  2. ^ a b Vrandečić, Denny (7 August 2013). "A proposal towards a multilingual Wikipedia - Meta". meta.wikimedia.org. Retrieved 14 August 2023.
  3. ^ "Abstract Wikipedia/Historic proposal - Meta". meta.wikimedia.org. Retrieved 14 August 2023.
  4. ^ Vrandečić, Denny (8 April 2020). "Architecture for a multilingual Wikipedia". arXiv:2004.04733 [cs.CY].
  5. ^ Maher, Katherine. "Abstract Wikipedia/June 2020 announcement - Meta". meta.wikimedia.org.
  6. ^ ""Abstract Wikipedia": Neues Projekt soll Wissen in alle Sprachen übersetzen". RedaktionsNetzwerk Deutschland (in German). 6 July 2020. Retrieved 6 July 2020.
  7. ^ Rixecker, Kim (6 July 2020). "Abstract Wikipedia: Wie das Online-Lexikon eines seiner größten Probleme lösen will". t3n Magazine (in German). Retrieved 6 July 2020.
  8. ^ Vrandečić, Denny (April 2021). "Building a Multilingual Wikipedia". Communications of the ACM. 64 (4): 38–41. doi:10.1145/3425778. ISSN 0001-0782. Wikidata Q106143058.
  9. ^ a b c d e Bayer, Tilman (1 January 2023). "Wikimedia Foundation's Abstract Wikipedia project "at substantial risk of failure"". The Signpost. Retrieved 14 August 2023.
  10. ^ Vrandečic, Denny (26 July 2023). "Abstract Wikipedia/Updates/2023-07-26 - Meta". meta.wikimedia.org. Retrieved 22 September 2023.
  11. ^ "Abstract Wikipedia/Architecture - Meta". meta.wikimedia.org. Retrieved 14 August 2023.
  12. ^ "Abstract Wikipedia/Components - Meta". meta.wikimedia.org. Retrieved 14 August 2023.
  13. ^ "Abstract Wikipedia/Examples - Meta". meta.wikimedia.org. Retrieved 14 August 2023.

External links[edit]