{"id":542,"date":"2025-01-03T21:04:41","date_gmt":"2025-01-03T21:04:41","guid":{"rendered":"https:\/\/kie.ue.poznan.pl\/en\/?p=542"},"modified":"2025-01-09T21:19:28","modified_gmt":"2025-01-09T21:19:28","slug":"workshop-on-web-content-exploration","status":"publish","type":"post","link":"https:\/\/kie.ue.poznan.pl\/en\/news\/workshop-on-web-content-exploration\/","title":{"rendered":"Workshop on web content exploration"},"content":{"rendered":"<p>Workshop for students focusing on the exploration and processing of online content were held at the Pozna\u0144 University of Economics and Business. The session was led by <strong>Mateusz Kuczy\u0144ski<\/strong>, a student at our university who combines his master\u2019s studies in Informatics and Econometrics with work in the field of data exploration.<!--more--><\/p>\n<p>During the meeting, both theoretical and practical fundamentals necessary to start independent data acquisition from websites were discussed. The topics covered included downloading and processing data in HTML format, as well as best practices and potential challenges. Students learned how to make use of built-in analytical tools that enable monitoring the HTML structure, CSS styles, and network requests, to effectively identify elements for further processing. Methods of efficiently parsing websites, retrieving their content, and saving the obtained information in formats suitable for further data analysis \u2014 using Python libraries such as <em>bs4 (BeautifulSoup)<\/em>, <em>requests<\/em>, and <em>pandas<\/em> \u2014 were also presented. Participants had the opportunity to observe each implementation step in real time, ask questions, and discuss potential issues related to data selection or technical constraints.<\/p>\n<p>The workshop took place on December 19, 2024, and were organized by <a href=\"https:\/\/kie.ue.poznan.pl\/en\/srg-data-science\/\">SRG &#8220;Data Science&#8221;<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Workshop for students focusing on the exploration and processing of online content were held at the Pozna\u0144 University of Economics and Business. The session was led by Mateusz Kuczy\u0144ski, a student at our university who combines his master\u2019s studies in Informatics and Econometrics with work in the field of data exploration.<\/p>\n","protected":false},"author":1,"featured_media":543,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[234,172,34],"class_list":["post-542","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-news","tag-mateusz-kuczynski","tag-python","tag-srg-data-science"],"_links":{"self":[{"href":"https:\/\/kie.ue.poznan.pl\/en\/wp-json\/wp\/v2\/posts\/542","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/kie.ue.poznan.pl\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/kie.ue.poznan.pl\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/kie.ue.poznan.pl\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/kie.ue.poznan.pl\/en\/wp-json\/wp\/v2\/comments?post=542"}],"version-history":[{"count":0,"href":"https:\/\/kie.ue.poznan.pl\/en\/wp-json\/wp\/v2\/posts\/542\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/kie.ue.poznan.pl\/en\/wp-json\/wp\/v2\/media\/543"}],"wp:attachment":[{"href":"https:\/\/kie.ue.poznan.pl\/en\/wp-json\/wp\/v2\/media?parent=542"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/kie.ue.poznan.pl\/en\/wp-json\/wp\/v2\/categories?post=542"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/kie.ue.poznan.pl\/en\/wp-json\/wp\/v2\/tags?post=542"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}