The Estonian Open Data Forum – Celebrating Progress and Recognizing Achievements

This October, I had the distinct honor of participating in Estonia’s premier event on open data, the Open Data Forum (Avaandmete foorum), organized by the Ministry of Economic Affairs and Communications of Estonia, invited to talk about the role of academia and private sector in the open government data landscape. This annual gathering brings together industry experts, academic researchers, and government leaders to discuss key trends, achievements, and the future of open data in Estonia, along with highlighting the contributions coming from universities awarding the best dissertations developed by Estonian students. The latter made this event very special for me, as one of my students – Kevin Kliimask – was awarded for his outstanding bachelor’s thesis 🏆 🥇 🏅!

In his thesis –“Automated Tagging of Datasets to Improve Data Findability on Open Government Data Portals,” Kevin developed an LLM-powered interface to automate dataset tagging in both English and Estonian, thereby augmenting metadata preparation by data publishers and improving data findability on portals by users – as the practice shows their presence tend to be a challenge. E.g., our analysis conducted on the Estonian Open Data Portal, revealed that 11% datasets have no associated tags, while 26% had only one tag assigned to them, which underscores challenges in data findability and accessibility within the portal, which, according to the recent Open Data Maturity Report, is considered trend-setter. The developed solution was evaluated by users and their feedback was collected to define an agenda for future prototype improvements. The thesis has been already transformed into the scientific paper 👉 TAGIFY: LLM-powered Tagging Interface for Improved Data Findability on OGD portals presented at the IEEE international conference (I posted on this earlier 👉 here).

As for my talk titled Unlocking the Power of Open Data: The Role of Academia and the Private Sector in Building Inclusive and Sustainable Open Data Ecosystems, I emphasized the need for a holistic approach to open data that transcends merely opening/publishing data, rather requiring adopting a systemic view that considers an open data initiative as an Open Data Ecosystem (also confirmed by Open Data Charter 👉here), as we deal not only with open data (availability), portal, stakeholders, actors, but also processes surrounding them, emerging technologies & different forms of intelligence, going beyond just Artificial Intelligence, whose role, however, is crucial (see our paper on the eight-fold role of AI in OGD).

As such, while discussing the main idea of ​​the talk – the role of academia & the private sector in the ODE, which as per me is at least four-fold, namely – data consumers, data providers, contributors to ODE sustainability & myth busters on the global stage (assigning “Made in Estonia” tag to OGD in addition to the one we have for e-government), I also expanded the general mantra of “Data For AI” to “data for AI, AI for data, data not only for AI and not only AI for data”.


A big thank you to the organisers, who gathered so many speakers (Cybernetica, FinEst Centre for Smart Cities, Riigi Infosüsteemi Amet // Estonian Information System Authority (NCSC-EE), University of Tartu and many others) to discuss the highlights of today & tomorrow for Estonian Open Data and High-Value Datasets in particular as it was the main focus of the Forum – was happy to be part of these discussions!

The 25th Annual International Conference on Digital Government Research (DGO2024): a brief summary on presenter, track chair, panel organizer, and moderator roles

Last week, I had the pleasure of participating in the 25th Annual International Conference on Digital Government Research (DGO2024), organized by the Digital Government Society and hosted by National Taiwan University in the beautiful city of Taipei (Taiwan) under “Internet of Beings: Transforming Public Governance” theme. The conference offered an exceptional venue, warm hospitality from the local committee led by Helen Liu and her team, a rich social program, and an outstanding scientific program. The event featured well-selected keynotes and panels from prominent organizations such as Foxconn, the International Cooperation Center of TCA, Massachusetts Institute of Technology, Taipei Urban Intelligence Center, and the Ministry of Digital Affairs. Key topics included AI, Smart City initiatives, and Data Governance, which facilitated extensive networking and brainstorming sessions.

I was honored to contribute to this vibrant dialogue in multiple roles: presenter, track chair, panel organizer, and moderator. Together with my students and colleagues, we presented four papers, each reflecting our collaborative research efforts:

  1. Towards a Privacy and Security-Aware Framework for Ethical AI (Daria Korobenko, Anastasija NIkiforova, Rajesh Sharma). The proposed (conceptual at the moment) privacy and security-Aware Framework for ethical AI is centered around the Data, Technology, People, and Process dimensions, where each dimension is guided by a set of specific questions to encompass the overarching themes of privacy and security within AI systems, while the framework itself follows a risk-based approach (similar to the EU AI Act). As such, it is designed to assist diverse stakeholders, including organizations, academic institutions, and governmental bodies, in both the development and critical assessment of AI systems.
  2. Exploring Estonia’s Open Government Data Development as a Journey towards Excellence: Unveiling the Progress of Local Governments in Open Data Provision (Katrin Rajamae-Soosaar and Anastasija Nikiforova) that explores the evolution of Estonia’s 🇪🇪 OGD development at both national & local levels through analysis of indices, Estonian OGD portal, and a literature review. Findings reveal national progress due to portal improvements and legislative changes, while local governments lag in OGD provision, highlighting the need for future research on municipal OGD barriers and enablers.
  3. An Integrated Usability Framework for Evaluating Open Government Data Portals: Comparative Analysis of EU and GCC Countries (Fillip Molodtsov and Anastasija Nikiforova) develops a framework to evaluate OGD portal usability, considering user diversity, collaboration, and data exploration capabilities, and applies it to 33 national portals in the EU and GCC 🇪🇺🇸🇦🇶🇦🇧🇭🇦🇪, highlighting good practices and common shortcomings, emphasizing competitiveness of GCC portals
  4. Unlocking the Potential of Open Government Data: Exploring the Strategic, Technical, and Application Perspectives of High-Value Datasets Opening in Taiwan (Hsien-Lee Tseng and Anastasija Nikiforova). In short, data has an unprecedented value. However, availability of data in an open data format creates a little added value, where the value of these data [to the real needs of the end user], is key. This is where the concept of high-value dataset (HVD) comes into play, which has become popular in recent years (predominantly beforehand OD Directive by European Commission). Defining and opening HVD, in turn, is a complex process consisting of a set of interrelated steps, the implementation of which may vary from one country or region to another. Therefore, there has recently been a call to conduct research in a country or region setting considered to be of greatest national value. So far, only a few studies have been conducted, most of which consider only one step of the process, such as identifying HVD or measuring their impact. With this study, we explore the entire lifecycle of HVD opening in case of one of the world’s leading producers of ICT products – Taiwan. To do this, we conduct a qualitative study with exploratory interviews with representatives from government agencies in Taiwan responsible for HVD opening, namely Ministry of Digital Affairs, Ministry of the Interior, Ministry of Transportation and Communications, and the Ministry of Environment. As part of these interviews, we examine strategic aspects associated with HVD determination, technical aspects related to the dataset preparation stage (incl. data quality, granularity, update frequency, integration methods, or data evaluation), and application aspects related to the further assessment of the impact generated by HVD, identifying some good practices and weaknesses to be further examined and fixed.

I also chaired the track “Sustainable Public and Open Data Ecosystems,” which we launched this year with colleagues, on which I posted before. Although this is the very new track, we received a good number of contributions as it appeared to be very timely and we hope to see it to have a continuation, serving as a stage for the dialogue by Digital Government Society around the public and open data ecosystem in and for our digital future. At least this session has demonstrated the interest in such an environment – many thanks to all, who actively participated in this discussion. BTW, should you be interested in difference between public vs open data ecosystem, I encourage you to read our conceptualization and typology in our “Understanding the development of public data ecosystems: from a conceptual model to a six-generation model of the evolution of public data ecosystems” paper. We also are optimistic that the best contributions from this track will soon be available in a special section of the Information Polity Journal that we have recently launched.

In addition, together with Hsien-Lee Tseng, we organized the panel “Sociotechnical Transformation in the Decade of Healthy Ageing to Empower the Silver Economy: Bridging the Silver Divide through Social and Digital Inclusion,” which addressed crucial issues related to the integration of aging populations into the digital economy and society. Our discussions focused on case studies from Taiwan and Estonia, two regions with significant aging populations and leaders in ICT and digital government. We explored several innovative initiatives:

  1. The Aged Dwelling Plan by the Ministry of Interior of Taiwan, which proactively delivers resources to those most in need through the Senior Living Needs Index Framework. It integrates cross-agency data such as household registration, building information, long-term care, low-income households, and open geospatial data.
  2. The Digital Silver Hub constituting the ecosystem fosters innovative solutions for the silver population, involving the public sector, private sector, academia, and end-users. It utilizes a collective intelligence model to address the challenges faced by older adults.
  3. Health Promotion, Technology Inclusion by National Taitung University aimed at achieving technological inclusion, this project focuses on non-discriminatory health promotion technology policies and activities for people with chronic diseases.

As such, our discussions highlighted the opportunities and challenges in supporting the Decade of Healthy Ageing, an initiative by the United Nations. Key themes included Data Management, Security, and Privacy, Digital Literacy and Regional Adoption, Human-Computer Interaction (HCI) and User-Centric Design, Interoperability. Our panel concluded that there is no one-size-fits-all solution to the challenges faced by the aging population. Instead, it is crucial to recognize and leverage the capacities and strengths of each region to develop tailored solutions, whether they be social, technical, or sociotechnical. By doing so, we can create effective and sustainable strategies to support healthy aging and bridge the silver divide.

The conference also featured a working meeting on the new Digital Government Society Chapter, “Artificial Intelligence & Government.” I contributed to the discussions and look forward to continued involvement and impact in this ambitious initiative led by Fadi Salem.

In summary, DGO2024 was an incredibly insightful and productive week.

Generative AI Role in Shaping the Future of Open Data Ecosystems: Synergies amidst Paradoxes

The role of Generative AI is the subject for debates in almost every domain today, and the open data (ecosystem) domain is no exception. Here’s my two cents on this with the blog post “Generative AI Role in Shaping the Future of Open Data Ecosystems: Synergies amidst Paradoxes”.
In this blog post, I present some personal observations and predictions on how Generative AI will stop open “data winter” or even give an impetus to the “data spring” the call for what has been made recently. While these steps may be many and different, one obvious element that could affect the current state of affairs is Artificial Intelligence, particularly in the form of Generative AI. Along with this “forecast” and high-level discussion that is expected to be made more in-depth and likely evidence-based (since, together with my colleagues and students, we are already working in this direction), some paradoxes are mentioned among this symbiotic relationship between Generative AI and open data (ecosystem)…

IFIP EGOV-CeDEM-EPART 2023 – retrospective on how it was? From Metaverse to wine tasting

It finally took place! EGOV2023 – IFIP EGOV-CeDEM-EPART – one of the most recognized conference in e-Government, ICT and public administration and related topics (incl., Smart Cities, Sustainability, Innovation and many more) that lasted 3 days in charming city of Budapest (Hungary) is over, and I am here to reflect on it (just in a few words), since although these were just 3 days, they were very busy and full of insights, as well as activities, since every day I took another role, i.e., day#1 – presenter of the paper, day#2 – workshop organizer, day#3 – chair of two out of three sessions of “Emerging Issues and Innovations” track I co-chaired together with Marijn Janssen, Csaba Csaki and Francesco Mureddu. Not to forget, in this conference I am also a program committee of Open Data track.

Let me now provide a few insights on all these days, including my roles.

Let’s start with day#1… After conference opening by Ida Lindgren and Csaba Csaki – our local host, who did a great job – organized a very unique conference with exceptionally rich social programme, a brilliant keynote talk was delivered by Professor Yogesh K Dwivedi (possibly the most impactful researcher in the area) on Metaverse for Government and associated Challenges, Opportunities, as well as Future Research Agenda, as part of which the claim of a lack of studies on this topic was made. Luckily, our track “Emerging Issues and Innovations” has accepted one paper on Metaverse in digital government, which was the only at the conference, however, unfortunately, the discussion had not happened due to earlier departure of Yogesh and late arrival of authors. Anyway, almost immediately after the keynote the session, where I delivered a talk on HVD determination “Towards High-Value Datasets determination for data-driven development: a systematic literature review” (authors: Nikiforova, Rizun, Ciesielska, Alexopoulos, Miletić) took place. Just to remind you, I posted on this paper before – this is that paper, which has been already named “signal in the noise“, in which we asked ourselves and the current body of the knowledge (this is a systematic literature review-driven study):
❓how is the value of the open government data perceived / defined? Are local efforts being made at the country levels to identify dataset that provide the most value to stakeholders of the local open data ecosystem?
❓What datasets are considered to be of higher value in terms of data nature, data type, data format, data dynamism?
❓What indicators are used to determine HVD?
❓Whether there is a framework for determining country-specific HVD? I.e., is it possible to determine what datasets are of value and interest for their reuse & value creation, taking into account the specificities of the country, e.g., culture, geography, ethnicity, likelihood of crises and/or catastrophes.
Although neither OGD, nor the importance of data value are new topics, scholarly publications dedicated to HVD are very limited that makes study unique and constituting a call for action – probably this is also why it it is recommended for reading not only by us but also by The Living Library (by New York University, NYU Tandon School of Engineering, govlab). All in all, we have established some knowledge based, incl. several definitions of HVD, data-related aspects, stakeholders, some indicators and approaches that can now be used as a basis for establishing a discussion of what a framework for determining HVD should look like, which, along with the input we received from a series of international workshops as part of ICEGOV2022, ICOD2022 and DGO2023 with open data experts could enrich the common understanding of the goal, thereby contributing to the next open data wave.
👉Read the paper here
👉See slides here
👉Find supplementary data in open access at Zenodo here
Here I am very grateful to session attendees for raising a discussion around the topic, where some of those comments confirmed once more the correctness of both the problem statement and our future plans – thanks a lot!

Day#2 of started with another keynote talk, whcih this time delivered by Andras Koltay (President of the National Media and Infocommunications Authority and the Media Council of Hungary) on the protection of freedom of expression from social media platforms – very different but yet very insightful talk. Then, my second role of the workshop organizer and chair followed. As part of our workshop “PPPS’2023 – Proactive and Personalised Public Services: Searching for Meaningful Human Control in Algorithmic Government” (chairs: Anastasija Nikiforova, Nitesh Bharosa, Dirk Draheim, Kuldar Taveter). As part of this workshop, which took place in a hybrid mode (not an easy task), we initiated a discussion about personalised and Proactive Public Services, i.e.:
🎯talked about the concepts of public services, reactive and proactive models of public services, and models of their personalization;
🎯asked participants to share their views on public services and the levels of proactivity and personalisation of these services in their countries aiming to develop concepts for holistic proactive and personalised public service delivery;
🎯tried to establish a clearer vision of the “as-is” model and the necessary transition to the “to-be” model, their underlying factors, as well as pitfalls of which governments should be aware when designing, developing, and setting up proactive and personalised public services, trying to understand what are those emerging technologies that will likely have greater effect on public services in terms of both driving them or creating obstacles / barriers for their development and maintenance.
Read a bit more 👉 here
Special thanks to all participants, who attended and were very active (and survived)!

And now a few insights from day#3, when three sessions of our Emerging Issues and Innovations track (chairs: Marijn Janssen, Anastasija Nikiforova, Dr. Csaba Csaki, Francesco Mureddu) finally took place, where I was delighted to chair two of these sessions. Within these three sessions, 8 very diverse, but at the same time super interesting and insightful talks were delivered (predominantly from the United Nations University and Sweden), namely:
✍Metaverse vs. metacurse: The role of governments and public sector use cases by Charmaine Distor, Soumaya Ben Dhaou, & Morten Meyerhoff Nielsen that can be seen as a continuation of the keynote talk by Prof. Yogesh Dwivedi delivered at the 1st day;
✍Dynamic Capabilities and Digital Transformation in Public sector: Evidence from Brazilian case study by Larissa Magalhães;
✍Affording and constraining digital transformation: The enactment of structural change in three Swedish government agencies by Malin Tinjan, Robert Åhlén, Susanna Hammelev Jörgensen & Johan Magnusson
✍The Vicious Cycle of Magical Thinking: How IT Governance Counteracts Digital Transformation by Susanna H. Jörgensen, Tomas Lindroth, Johan Magnusson, Malin Tinjan, Jacob Torell & Robert Åhlen
✍Buridan’s Ass: Encapsulation as a Possible Solution to the Prioritization Dilemma of Digital Transformation by Johan Magnusson, Per Persson, Jacob Torell & Ingo Paas
✍Measuring digital transformation at the local level: assessing the current state of Flemish municipalities by Lieselot Danneels & Sarah Van Impe
✍Blockchain and the GDPR – the shift needed to move forward by Inês Campos Ruas, Soumaya Ben Dhaou & Zoran Jordanoski
✍Construct Hunting in GovTech Research: An Exploratory Data Analysis by Mattias Svahn, Aron Larsson, Eloisa Macedo and Jorge Bandeira
Read papers 👉 here, here & here
Big thanks go to both authors and presenters, as well as the audience, who was very active (even despite the fact that it was the very last day of the conference) and made these sessions a success!
And right after these two sessions, the third keynote by Laszlo Trautmann “The ethics of expertise – the political economy implications of AI”.


And the last but not the least, yet another social event – wine tasting at Etyeki Kúria Borászat / Winery, which was the perfect happy end of the EGOV2023!

Exceptional organization by Corvinus University of Budapest, Csaba Csaki and his team, International Federation for Information Processing (IFIP), Digital Government Society – cheers!🍷🍷🍷

Keynote at the 5th International Conference on Advanced Research Methods and Analytics (CARMA 2023)

June 28 I had the honor to participate in the opening of CARMA2023 – 5th International Conference on Advanced Research Methods and Analytics “Internet and Big Data in Economics and Social Sciences” delivering my keynote “Public data ecosystems in and for smart cities: how to make open / Big / smart / geo data ecosystems value-adding for SDG-compliant Smart Living and Society 5.0?” in the spectacular city of Sevilla, Spain 🇪🇸 🇪🇸 🇪🇸. What a honor to open the conference, immediately after the inaugural speech by organizers and sponsors, including representatives of Joint Research Center, European Commission (JRC), who even mentioned the topics I covered in my keynote (not limited to them, of course) as those that make this conference an event to attend and to learn from!!!

In this talk, as the title suggests, I:

  • elaborated on the concepts of public /open data (incl. OGD), smart city and SDG and how are they related?
  • introduced the concept of Society 5.0 and how is it related to open data?
  • and finally, and more importantly, public/ open data ecosystem – what it is? what does it consist of?

I then dived into (1) data-related aspects of the public data ecosystem, i.e. what are the data-related prerequisites for a sustainable and resilient data ecosystem? (2) data portal / platforms as entry points and how to make it sufficiently attractive for the target audience? (3) stakeholder engagement – how to involve the target audience? what are the benefits of their involvement? and some more things.

Public data ecosystem part was built around our “Transparency of open data ecosystems in smart cities: Definition and assessment of the maturity of transparency in 22 smart cities“, with some references to other studies such us Transparency-by-design: What is the role of open data portals?, “Timeliness of Open Data in Open Government Data Portals Through Pandemic-related Data: A long data way from the publisher to the user“, “Open government data portal usability: A user-centred usability analysis of 41 open government data portals“, which were previously noticed by the Living Library that recommends studies they see as the “signal in the noise” and the Open Data Institute.

For the data, apart of almost “classical things”, I referred to the topic of “high-value datasets” and dived into a taxonomy we presented in “Towards High-Value Datasets determination for data-driven development: a systematic literature review” (also recommended by the Living Library as the “sound in the noise”), enriched by the results of my earlier study “Towards enrichment of the open government data: a stakeholder-centered determination of High-Value Data sets for Latvia” as well as results of two international workshops we organized.

The part on the public / open data, smart city, SDG and Society 5.0 and how they are interrelated was, in turn, based on our Chapter “The Role of Open Data in Transforming the Society to Society 5.0: A Resource or a Tool for SDG-Compliant Smart Living?”, which was called by FIT Academy “a groundbreaking research”.

And for the engagement, it mostly was about the workshops, datathons, hackathons, data competitions, as we as a co-creation and how the co-creation ecosystem occurs, what are the prerequisites for this etc., incl. referencing to “Open data hackathon as a tool for increased engagement of Generation Z: to hack or not to hack?” and “The Role of Open Government Data and Co-creation in Crisis Management: Initial Conceptual Propositions from the COVID-19 Pandemic

CARMA is a forum for researchers and practitioners to exchange ideas and advances on how emerging research methods and sources are applied to different fields of social sciences as well as to discuss current and future challenges with main focus on the topics such as Internet and Big Data sources in economics and social sciences including Social media and public opinion mining, Web scraping, Google Trends and Search Engine data, Geospatial and mobile phone data, Open data and public data, Big Data methods in economics and social sciences such as Sentiment analysis, Internet econometrics, AI and Machine learning applications, Statistical learning, Information quality and assessment, Crowdsourcing, Natural Language processing, Explainability and interpretability, the applications of the above including but not limited to Politics and social media, Sustainability and development, Finance applications, Official statistics, Forecasting and nowcasting, Bibliometrics and sciencetometrics, Social and consumer behaviour, mobility patterns, eWOM and social media marketing, Labor market, Business analytics with social media, Advances in travel, tourism and leisure, Digital management, Marketing Intelligence analytics, Data governance, and Digital transition and global society, which, in turn, expects contributions in relation to Privacy and legal aspects, Electronic Government, Data Economy, Smart Cities, Industry adoption.

In addition to the regular sessions, poster session and two keynotes, a Special JRC session (EC) took place, during which Luca Barbaglia, Nestor Duch Brown, Matteo Sostero and Paolo Canfora presented projects they work on.

Great thanks goes to organizers and sponsors of CARMA2023 – Universidad de SevillaCátedra Metropol ParasolCátedra Digitalización Empresarial, IBMUniversitat Politècnica de ValènciaJoint Research Center – European Commission and Coca-Cola, who made this event a true success. Enjoyed this experience very much! Excellent venue! Great audience! ¡Muchas gracias!

References: