70,000 Books from Archives and Zero Budget Hryvnias: How Ukraine Is Building Its Own AI

"Siyvo" — the first national large language model — is being trained on archival texts that no AI project has previously touched. But the most unusual thing about this state project is that the state doesn't fund it.

By Tetiana Suchkova-Ladik

April 10, 2026 · 2 min read

70,000 Books from Archives and Zero Budget Hryvnias: How Ukraine Is Building Its Own AI — Ілюстративне фото: Depositphotos

The State Archives of Ukraine (Ukrderzharkhiv) has transferred approximately 10 terabytes of data to train the AI model "Siayvo" — equivalent to roughly 70,000 books. According to Acting Minister of Digital Transformation Oleksandr Bornyakov, a significant portion of these materials has never been used in similar projects before. To put this in perspective: the entire English-language Wikipedia weighs approximately 21 GB — the archive transferred nearly five times more.

A State Project Without State Funding

The most unusual detail about "Siayvo" is its financing scheme. Kyivstar is covering all development costs, after which the model will be transferred to the state. As Bornyakov explains in a column for AIN, the logic is straightforward: "in the conditions of war, every budget hryvnia must go to defense". In return, the operator receives a reputational and commercial asset — and priority access to the model.

An open technical foundation was chosen: Gemma 3 from Google, which the Digital Transformation Ministry team will refine together with Kyivstar using Ukrainian data. This same architecture already served as the basis for the first Ukrainian LLMs — MamayLM and Lapa LLM, as well as the Bulgarian BgGPT. In other words, "Siayvo" is not built from scratch, but rather a deep adaptation of an existing open model to the language and context.

50+ Organizations and the Paper Problem

Over 50 organizations have already joined the initiative — businesses, media, universities, and research institutions. The Digital Transformation Ministry continues an open call for partners: seeking news, textbooks, scientific literature, fiction, and archival materials.

"The most important part of the work is data preparation. For an effective Ukrainian model, we need not just internet texts, but also historical archives and other written sources."
Sud.ua, on preparing the "Siayvo" dataset

However, there is a specific problem: a significant portion of materials still exists only on paper. Digitizing archives, which in peacetime would have been a matter of convenience, has suddenly become critical for the model's quality.

The Name Was Chosen by 136,000 People

"Siayvo" won the vote in the "Diia" app among over 136,000 participants — with a result of 22,601 votes from ten finalists selected from over 3,000 proposals. The margin from second place was approximately three thousand votes.

Open beta testing for everyone is planned for the end of spring 2026. The long-term goal is more ambitious: by 2030, Ukraine wants to enter the global top-3 in AI development.

The real question, which will be answered during beta testing: will 10 TB of archival texts — combined with the rest of the dataset — provide sufficient quality understanding of context for "Siayvo" to surpass publicly available models precisely where they traditionally fail: in the nuances of Soviet bureaucratic legacy, dialects, and documents that never made it to the internet.

Technologies

Ferrari Luce for €550,000: The Brand's First Electric Car Was Developed for Five Years — and It Has No Touchscreen

May 26, 2026

Technologies

Delete — and it will disappear from OneDrive too. But only until September 2026

May 26, 2026

Technologies

Terra A2: Japanese radar-integrated interceptor deployed to the front — Shahed will meet it 75 km from target

May 25, 2026

Technologies

Samsung Galaxy S26 FE: case reveals more than manufacturer intended

May 25, 2026

Latest

Politics

Britain prepares sanctions against Russia's financial sector — Zelensky's office

Presidential envoy confirms: London preparing new restrictions following criticism over weakened oil price cap. Whether control mechanism will be implemented remains key question.

May 26, 2026

Society

Army deserter opens fire on court officers who came to return him

# Translation A 39-year-old serviceman who illegally left his unit encountered military police representatives with two revolver shots. The Desnyansky incident in broader context: after August 30, 2025, amnesty return from the Armed Forces became legally impossible.

May 26, 2026

Technologies

Ferrari Luce for €550,000: The Brand's First Electric Car Was Developed for Five Years — and It Has No Touchscreen

Ferrari has officially unveiled the Luce, a four-door, five-seater electric vehicle with 1,000 horsepower priced from €550,000. The design is by Jony Ive, it features physical buttons, and Ferrari's stock fell 3% on the day of the presentation.

May 26, 2026

Technologies

Delete — and it will disappear from OneDrive too. But only until September 2026

Samsung and Microsoft are removing the built-in Gallery synchronization with OneDrive. It's not a disaster, but there's one behavioral nuance that few people know about: after switching to a separate application, deleting photos from your smartphone will no longer erase them from the cloud.

May 26, 2026

Business

EU Against Google: Why the Latest Fine Could Change More Than Previous Ones

# European Regulators Target Google Again — This Time Over Digital Markets Act Violations. What's Behind the Accusations and Why It Matters Beyond the Corporation European regulators have renewed their scrutiny of Google, this time focusing on alleged violations of the Digital Markets Act. The charges underscore Brussels' increasingly aggressive stance on big tech monopolies and what officials say are anticompetitive practices. The accusations center on how Google leverages its dominance across multiple digital services — from search to advertising to mobile platforms — to disadvantage competitors. Regulators claim the company is using its market power in ways that stifle innovation and limit consumer choice. The case carries significance far beyond Google itself. It signals how the EU is attempting to enforce its landmark Digital Markets Act, legislation designed to curb the gatekeeping power of tech giants. A potential penalty could set precedent for how other large technology companies face similar scrutiny. For consumers and smaller tech firms, the outcome could reshape the digital landscape by creating more room for competition. For Google, fines and operational restrictions could fundamentally alter its business model in Europe, the world's most stringent regulatory market. The case also reflects a broader geopolitical divide, with the EU pursuing a regulatory approach that contrasts sharply with the lighter-touch oversight favored in the United States.

May 26, 2026

EveryNews

70,000 Books from Archives and Zero Budget Hryvnias: How Ukraine Is Building Its Own AI

A State Project Without State Funding

50+ Organizations and the Paper Problem

The Name Was Chosen by 136,000 People

Related

Ferrari Luce for €550,000: The Brand's First Electric Car Was Developed for Five Years — and It Has No Touchscreen

Delete — and it will disappear from OneDrive too. But only until September 2026

Terra A2: Japanese radar-integrated interceptor deployed to the front — Shahed will meet it 75 km from target

Samsung Galaxy S26 FE: case reveals more than manufacturer intended

Latest

Britain prepares sanctions against Russia's financial sector — Zelensky's office

Army deserter opens fire on court officers who came to return him

Ferrari Luce for €550,000: The Brand's First Electric Car Was Developed for Five Years — and It Has No Touchscreen

Delete — and it will disappear from OneDrive too. But only until September 2026

EU Against Google: Why the Latest Fine Could Change More Than Previous Ones