{"id":379,"date":"2023-12-01T09:03:31","date_gmt":"2023-12-01T09:03:31","guid":{"rendered":"https:\/\/kie.ue.poznan.pl\/en\/?p=379"},"modified":"2023-12-27T17:24:00","modified_gmt":"2023-12-27T17:24:00","slug":"from-science-to-practice-identifying-important-sources-of-information-on-wikipedia","status":"publish","type":"post","link":"https:\/\/kie.ue.poznan.pl\/en\/news\/from-science-to-practice-identifying-important-sources-of-information-on-wikipedia\/","title":{"rendered":"From science to practice: identifying important sources of information on Wikipedia"},"content":{"rendered":"<p>Wikipedia, being a widely available source of information in the digital era, attaches great importance to the verifiability of its content, which is fundamental to its credibility and trust. The platform&#8217;s verifiability rules require that all information, especially controversial or controversial information, be supported by credible, published sources. This ensures that the content in Wikipedia articles is not based on personal opinion or original research. However, the subjective nature of the concept of credibility and the dependence of the assessment on many factors (including language version or topic) may create a certain problem for users editing Wikipedia in terms of selecting appropriate sources of information.<!--more--><\/p>\n<p>With the huge number of websites (currently over a billion), individually assessing the credibility of each source becomes a challenge for Wikipedia users. Although there are detailed guidelines in various language versions of Wikipedia that define what reliable sources are, there is no comprehensive list of websites or other sources of information that can be considered reliable in the context of the various topics covered on Wikipedia. Additionally, the credibility and reputation of websites may change over time, and evaluation criteria may vary depending on the language version of Wikipedia or the topic area, which requires regular updates of such lists. For this reason, a comprehensive and constantly updated list of reliable sources would be very helpful not only to Wikipedia editors, but also to its readers who are looking for accurate and reliable information.<\/p>\n<p>Based on the analysis of over 60 million articles on Wikipedia, it is possible to extract information about over 330 million references (footnotes with information sources). This allowed the identification of the best information sources of Wikipedia using different assessment models. The table below shows the results of references extraction for selected language versions and the number of unique websites in October 2023:<\/p>\n<style>table.wikistat tr th{text-align:center !important;} .vright {text-align: right !important;}<\/style>\n<table class=\"wikistat\">\n<tbody>\n<tr>\n<th style=\"width: 10%;\">Wiki<\/th>\n<th style=\"width: 20%;\">Language Version<\/th>\n<th style=\"width: 25%;\">Number of Articles<\/th>\n<th style=\"width: 25%;\">Number of References<\/th>\n<th style=\"width: 20%;\">Unique Websites<\/th>\n<\/tr>\n<tr>\n<td>ar<\/td>\n<td>Arabic<\/td>\n<td class=\"vright\">1,219,168<\/td>\n<td class=\"vright\">6,355,164<\/td>\n<td class=\"vright\">294,089<\/td>\n<\/tr>\n<tr>\n<td>ca<\/td>\n<td>Catalan<\/td>\n<td class=\"vright\">735,551<\/td>\n<td class=\"vright\">3,895,389<\/td>\n<td class=\"vright\">197,470<\/td>\n<\/tr>\n<tr>\n<td>cs<\/td>\n<td>Czech<\/td>\n<td class=\"vright\">532,602<\/td>\n<td class=\"vright\">2,752,877<\/td>\n<td class=\"vright\">119,313<\/td>\n<\/tr>\n<tr>\n<td>de<\/td>\n<td>German<\/td>\n<td class=\"vright\">2,839,878<\/td>\n<td class=\"vright\">14,473,501<\/td>\n<td class=\"vright\">622,551<\/td>\n<\/tr>\n<tr>\n<td>en<\/td>\n<td>English<\/td>\n<td class=\"vright\">6,722,214<\/td>\n<td class=\"vright\">79,687,819<\/td>\n<td class=\"vright\">1,942,579<\/td>\n<\/tr>\n<tr>\n<td>es<\/td>\n<td>Spanish<\/td>\n<td class=\"vright\">1,833,749<\/td>\n<td class=\"vright\">12,558,623<\/td>\n<td class=\"vright\">509,313<\/td>\n<\/tr>\n<tr>\n<td>fa<\/td>\n<td>Persian<\/td>\n<td class=\"vright\">975,931<\/td>\n<td class=\"vright\">2,477,763<\/td>\n<td class=\"vright\">133,634<\/td>\n<\/tr>\n<tr>\n<td>fi<\/td>\n<td>Finnish<\/td>\n<td class=\"vright\">559,931<\/td>\n<td class=\"vright\">3,371,084<\/td>\n<td class=\"vright\">138,320<\/td>\n<\/tr>\n<tr>\n<td>fr<\/td>\n<td>French<\/td>\n<td class=\"vright\">2,557,559<\/td>\n<td class=\"vright\">19,455,752<\/td>\n<td class=\"vright\">576,523<\/td>\n<\/tr>\n<tr>\n<td>he<\/td>\n<td>Hebrew<\/td>\n<td class=\"vright\">342,285<\/td>\n<td class=\"vright\">1,867,068<\/td>\n<td class=\"vright\">103,848<\/td>\n<\/tr>\n<tr>\n<td>hi<\/td>\n<td>Hindi<\/td>\n<td class=\"vright\">162,954<\/td>\n<td class=\"vright\">496,057<\/td>\n<td class=\"vright\">47,617<\/td>\n<\/tr>\n<tr>\n<td>hu<\/td>\n<td>Hungarian<\/td>\n<td class=\"vright\">530,977<\/td>\n<td class=\"vright\">2,545,152<\/td>\n<td class=\"vright\">124,536<\/td>\n<\/tr>\n<tr>\n<td>id<\/td>\n<td>Indonesian<\/td>\n<td class=\"vright\">661,844<\/td>\n<td class=\"vright\">2,672,604<\/td>\n<td class=\"vright\">162,924<\/td>\n<\/tr>\n<tr>\n<td>it<\/td>\n<td>Italian<\/td>\n<td class=\"vright\">1,829,095<\/td>\n<td class=\"vright\">8,856,574<\/td>\n<td class=\"vright\">278,232<\/td>\n<\/tr>\n<tr>\n<td>ja<\/td>\n<td>Japanese<\/td>\n<td class=\"vright\">1,388,532<\/td>\n<td class=\"vright\">14,684,917<\/td>\n<td class=\"vright\">359,446<\/td>\n<\/tr>\n<tr>\n<td>ko<\/td>\n<td>Korean<\/td>\n<td class=\"vright\">646,717<\/td>\n<td class=\"vright\">1,885,878<\/td>\n<td class=\"vright\">91,918<\/td>\n<\/tr>\n<tr>\n<td>nl<\/td>\n<td>Dutch<\/td>\n<td class=\"vright\">2,133,536<\/td>\n<td class=\"vright\">3,010,002<\/td>\n<td class=\"vright\">112,318<\/td>\n<\/tr>\n<tr>\n<td>no<\/td>\n<td>Norwegian<\/td>\n<td class=\"vright\">616,624<\/td>\n<td class=\"vright\">2,102,507<\/td>\n<td class=\"vright\">107,343<\/td>\n<\/tr>\n<tr>\n<td>pl<\/td>\n<td>Polish<\/td>\n<td class=\"vright\">1,583,919<\/td>\n<td class=\"vright\">8,847,928<\/td>\n<td class=\"vright\">242,835<\/td>\n<\/tr>\n<tr>\n<td>pt<\/td>\n<td>Portuguese<\/td>\n<td class=\"vright\">1,110,209<\/td>\n<td class=\"vright\">7,692,600<\/td>\n<td class=\"vright\">319,534<\/td>\n<\/tr>\n<tr>\n<td>ru<\/td>\n<td>Russian<\/td>\n<td class=\"vright\">1,940,113<\/td>\n<td class=\"vright\">15,461,960<\/td>\n<td class=\"vright\">454,351<\/td>\n<\/tr>\n<tr>\n<td>sv<\/td>\n<td>Swedish<\/td>\n<td class=\"vright\">2,572,575<\/td>\n<td class=\"vright\">11,791,609<\/td>\n<td class=\"vright\">134,081<\/td>\n<\/tr>\n<tr>\n<td>th<\/td>\n<td>Thai<\/td>\n<td class=\"vright\">158,905<\/td>\n<td class=\"vright\">1,010,438<\/td>\n<td class=\"vright\">70,395<\/td>\n<\/tr>\n<tr>\n<td>tr<\/td>\n<td>Turkish<\/td>\n<td class=\"vright\">533,201<\/td>\n<td class=\"vright\">2,773,455<\/td>\n<td class=\"vright\">146,854<\/td>\n<\/tr>\n<tr>\n<td>uk<\/td>\n<td>Ukrainian<\/td>\n<td class=\"vright\">1,289,727<\/td>\n<td class=\"vright\">5,455,954<\/td>\n<td class=\"vright\">217,787<\/td>\n<\/tr>\n<tr>\n<td>vi<\/td>\n<td>Vietnamese<\/td>\n<td class=\"vright\">1,288,093<\/td>\n<td class=\"vright\">3,796,577<\/td>\n<td class=\"vright\">147,041<\/td>\n<\/tr>\n<tr>\n<td>zh<\/td>\n<td>Chinese<\/td>\n<td class=\"vright\">1,379,496<\/td>\n<td class=\"vright\">8,130,187<\/td>\n<td class=\"vright\">283,516<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>During the webinar, <a href=\"https:\/\/kie.ue.poznan.pl\/en\/wlodzimierz-lewoniewski\/\">Dr. W\u0142odzimierz Lewoniewski<\/a> presented the possibilities of identifying and automatically assessing the importance of information sources of Wikipedia articles from different language versions. As part of the practical part, some of the capabilities of the <a href=\"https:\/\/bestref.net\">BestRef<\/a> tool were shown, which contains information about the results of the evaluation of millions of Internet sources in Wikipedia articles from the point of view of individual language versions.<\/p>\n<div style=\"text-align:center\">Webinar recording:<br \/><iframe loading=\"lazy\" src=\"https:\/\/www.facebook.com\/plugins\/video.php?height=314&#038;href=https%3A%2F%2Fwww.facebook.com%2FWikimediaPolska%2Fvideos%2F837514828116233%2F&#038;show_text=false&#038;width=560&#038;t=0\" width=\"560\" height=\"314\" style=\"border:none;overflow:hidden\" scrolling=\"no\" frameborder=\"0\" allowfullscreen=\"true\" allow=\"autoplay; clipboard-write; encrypted-media; picture-in-picture; web-share\" allowFullScreen=\"true\"><\/iframe><\/div>\n<p>The webinar took place on November 23, 2023. The organizer of the event is the <a href=\"https:\/\/wikimedia.pl\/\" rel=\"noopener noreferrer\" target=\"_blank\">Wikimedia Polska<\/a>, which supports and promotes Wikipedia and its sister projects (such as Wikidata, Wiktionary, Wikinews, Wikisource and others).<\/p>\n<p>More information about research on the analysis of information sources on Wikipedia can be found in scientific publications:<\/p>\n<ul>\n<li><a href=\"https:\/\/link.springer.com\/chapter\/10.1007\/978-3-031-29570-6_3\" target=\"_blank\" rel=\"noopener noreferrer\">Companies in Multilingual Wikipedia: Articles Quality and Important Sources of Information<\/a> (2023)<\/li>\n<li><a href=\"https:\/\/www.sciencedirect.com\/science\/article\/pii\/S1877050922012777\" target=\"_blank\" rel=\"noopener noreferrer\">Identification of Important Web Sources of Information on Wikipedia across various Topics and Languages<\/a> (2022)<\/li>\n<li><a href=\"https:\/\/arxiv.org\/abs\/2204.14130\" target=\"_blank\" rel=\"noopener noreferrer\"> Reliability in Time: Evaluating the Web Sources of Information on COVID-19 in Wikipedia across Various Language Editions from the Beginning of the Pandemic<\/a> (2022)<\/li>\n<li><a href=\"https:\/\/ieeexplore.ieee.org\/abstract\/document\/9908858\" target=\"_blank\" rel=\"noopener noreferrer\">Identifying Reliable Sources of Information about Companies in Multilingual Wikipedia<\/a> (2022)<\/li>\n<li><a href=\"https:\/\/www.mdpi.com\/2078-2489\/11\/5\/263\" target=\"_blank\" rel=\"noopener noreferrer\">Modeling Popularity and Reliability of Sources in Multilingual Wikipedia<\/a> (2020)<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Wikipedia, being a widely available source of information in the digital era, attaches great importance to the verifiability of its content, which is fundamental to its credibility and trust. The platform&#8217;s verifiability rules require that all information, especially controversial or controversial information, be supported by credible, published sources. This ensures that the content in Wikipedia articles is not based on personal opinion or original research. However, the subjective nature of <a href=\"https:\/\/kie.ue.poznan.pl\/en\/news\/from-science-to-practice-identifying-important-sources-of-information-on-wikipedia\/\" class=\"read-more\">&#8230; Read More<\/a><\/p>\n","protected":false},"author":1,"featured_media":380,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[47,22,150,15,13],"class_list":["post-379","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-news","tag-information-quality","tag-reliability","tag-webinar","tag-wikipedia","tag-wlodzimierz-lewoniewski"],"_links":{"self":[{"href":"https:\/\/kie.ue.poznan.pl\/en\/wp-json\/wp\/v2\/posts\/379","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/kie.ue.poznan.pl\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/kie.ue.poznan.pl\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/kie.ue.poznan.pl\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/kie.ue.poznan.pl\/en\/wp-json\/wp\/v2\/comments?post=379"}],"version-history":[{"count":0,"href":"https:\/\/kie.ue.poznan.pl\/en\/wp-json\/wp\/v2\/posts\/379\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/kie.ue.poznan.pl\/en\/wp-json\/wp\/v2\/media\/380"}],"wp:attachment":[{"href":"https:\/\/kie.ue.poznan.pl\/en\/wp-json\/wp\/v2\/media?parent=379"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/kie.ue.poznan.pl\/en\/wp-json\/wp\/v2\/categories?post=379"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/kie.ue.poznan.pl\/en\/wp-json\/wp\/v2\/tags?post=379"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}