What HUMAIN will buy in 2026: an outside-in read
Why this analysis, and where it comes from
HUMAIN was formally announced in May 2025 as an AI company wholly owned by Saudi Arabia’s Public Investment Fund (PIF)[^1]. In its first year, it issued a series of partnership announcements worth reading literally: Nvidia as the chip supplier of record[^2], AMD as a complementary compute partner[^3], Cisco for the network fabric[^4], and the development of ALLaM as the flagship Arabic foundation model[^5]. Its operational footprint includes a sovereign cloud, in-Kingdom data centers, and AI research teams distributed between Riyadh and global hubs[^10].
The question every regional annotation vendor is asking today: “where do I enter?” The honest answer requires distinguishing between three fundamentally different spend segments. This post separates them — outside-in, built only on public announcements and published executive interviews. I am not inside the procurement cycle. Annota8 is not a HUMAIN vendor of record. I write this because a candid practical read serves a reader looking for a map better than claiming knowledge I do not have.
HUMAIN’s five spend segments
| Segment | Estimated 2026-2027 size | Who takes it | Does a regional annotation vendor enter? |
|---|---|---|---|
| Data center build-out + chips + infrastructure | Tens of billions USD multi-year | Nvidia, AMD, Qualcomm, Groq, Cisco, AWS, Saudi construction contractors | No — this is not our game |
| Training data + curation for ALLaM | Mid-to-high 8-figure USD annually | Data + annotation + curation providers | Yes, accessible segment |
| Safety evaluation + red team + behavior annotation | Mid 8-figure to low 9-figure USD over ~24 months | Specialized eval houses + safety experts + prompt authors | Yes, high-value segment |
| Enterprise AI deployment partnerships | Distributed OpEx, low CapEx | System integrators + digital transformation partners | Partially — as an annotation partner, not as an integrator |
| Internal hiring + research infrastructure | Tens of millions USD annually | Arabic NLP research talent + academic partners | No — this is internal infrastructure |
These are estimates built on reasonable industry ratios for a company of this ambition, not disclosures. They err on the conservative side; the chance that actual numbers are higher exceeds the chance they are lower.
Segment 1: data centers and chips — not our game
The largest spend by a wide margin goes to compute infrastructure. The announced Nvidia partnership alone absorbs GPU volumes comparable to major global operators[^2]. AMD[^3] and Qualcomm[^9] serve as diversification sources to reduce single-vendor risk, Groq handles inference workloads[^8], Cisco handles data center networking[^4], and AWS provides cloud capacity[^7]. Add the construction spend for the in-Kingdom data center facilities, cooling systems, power, and physical security — and the conversation is in tens of billions USD over multiple years.
Any regional annotation vendor claiming access to this segment is either misleading or misunderstanding the field. This is not the data annotation business. The suppliers to this segment are chip manufacturers, hyperscale infrastructure integrators, and Saudi construction contractors. The point that matters for us: the size of this segment does not mean there are “crumbs” available for us inside it — it means HUMAIN’s executive and operational focus in 2026 will be concentrated on executing this build-out, which makes procurement cycles for smaller segments more methodical, not less.
Segment 2: ALLaM training data — this is where we enter
ALLaM is the Arabic foundation model HUMAIN is building (after SDAIA developed it in its earlier iterations)[^6]. Each successive training generation requires major training data expansion. This breaks operationally into three layers:
First, pre-training data — vast volumes of cleaned Arabic text, with comprehensive dialect coverage, genre diversity (news, literature, formal, dialogue, technical, scientific), and deep quality processing. This layer is high-volume but low margin on the specialized annotation side; most of the work is curation, cleaning, and deduplication.
Second, Supervised Fine-Tuning (SFT) data — high-quality, hand-crafted instruction-response pairs requiring researchers at academic linguistic level. This layer is much smaller in volume but the value margin per sample is orders of magnitude higher. This is where Cairo’s academic depth outcompetes alternatives.
Third, RLHF preference data — A/B response pairs with deliberate human preference. This layer shapes the model’s personality, safety, and cultural diplomacy. For an Arabic model at ALLaM’s level, the cultural and dialectal nuances in RLHF preferences are critical in a way that does not apply to an English model.
What a regional annotation vendor can credibly offer here: pre-training data curation at volume, SFT pairs at PhD-grade QA, RLHF samples with dialect coverage. The estimated spend in this segment lands in mid-to-high 8-figure USD annually — meaningful, requires multiple distributors, and rewards vendors who can pass strict quality gates.
Read more about our foundation model solutions and the ALLaM training data resource. Glossary entries for SFT and RLHF.
Segment 3: safety evaluation and red team — highest value per sample
Any LLM with production ambition at ALLaM’s level needs a deep safety evaluation layer before deployment. This covers several work types:
- Eval set construction — questions and scenarios that measure specific capabilities (reasoning, language comprehension, cultural knowledge, instruction following)
- Jailbreak red team labeling — systematic attempts to push the model into producing harmful or misaligned output, with labels for whether the model resisted or failed
- Cultural behavior evaluation — does the model handle sensitive topics in Saudi, Islamic, and Arab contexts appropriately
- Dialect evaluation — does the model understand and respond acceptably across dialects without bias toward Modern Standard Arabic
This segment is smaller in volume than training data but requires an elite tier of workers: language experts, cultural and Islamic studies specialists, cybersecurity experts, legal experts. Likely spend over ~24 months: mid 8-figure to low 9-figure USD. Margin per labor hour is substantially higher than standard annotation segments.
Glossary entries for eval set construction and jailbreak red team labeling.
Segment 4: enterprise AI deployment — partially accessible
HUMAIN aspires to be an engine for AI deployment across Saudi government and enterprise sectors. This generates a chain of application projects: AI for healthcare, education, government services, energy, financial services. Each application project carries a value chain:
- The system integrator (SI) that connects ALLaM to the customer’s systems
- The model adapter that handles sector-specific fine-tuning
- The data provider that supplies sector-specialized training or fine-tuning data
A regional annotation vendor typically enters as the third-layer data provider, sometimes as second-layer fine-tuning partner. Entering as a first-layer SI requires software engineering and project management capabilities most annotation houses do not have. The practical model: partner with Saudi SIs (Elm, Solutions by STC, NTGI) to deliver the data layer for projects they lead, rather than competing with them for the integrator role.
Segment 5: internal hiring + research — not accessible
HUMAIN is hiring Arabic NLP researchers, data scientists, and ML engineers aggressively. This is not a vendor opportunity — this is talent competition. Any regional vendor that has built a substantial researcher layer will lose some of them to HUMAIN in 2026-2027 to higher salaries and sovereign incentives. This is a reality to accept and plan around by building a deep leadership layer, not to deny.
Procurement realism: what HUMAIN requires of its vendors
HUMAIN, as a Saudi-designated entity, operates under semi-formal government procurement constraints even as a legally independent company. The practical consequences:
- MISA license: A foreign vendor needs an investment license from the Ministry of Investment to contract at any meaningful scale
- Saudization preference: Every mid-to-large contract considers the vendor’s Saudi headcount ratio under Nitaqat banding
- IKTVA-style scoring: The local content program that started in energy is spreading as a preference scoring criterion across other sectors. Vendors that domesticate spend earn points in evaluation.
- NSDAI (National Strategy for Data and AI): HUMAIN as an executing arm of part of NSDAI carries the strategy’s direction on sovereignty, national talent, and local value added.
- In-Kingdom processing: HUMAIN’s sensitive data, almost by definition, will not leave the Kingdom. An annotation vendor pursuing HUMAIN work needs real processing capacity inside KSA, not a representative office.
What a small/mid annotation vendor should not claim
Claim integrity is part of long-term commercial survival. What a vendor at our scale should not claim — to HUMAIN or to the market:
- We are not a hyperscaler. We will not provide compute infrastructure.
- We are not a chip partner. We will not compete with Nvidia, AMD, Qualcomm, or Groq on anything.
- We are not a system integrator at Elm, Solutions by STC, or NTGI scale. We will not lead full enterprise digital transformation programs.
- We are not a foundation model research lab. We will not build ALLaM v4. We will not design new model architectures.
- We are not a Saudi headcount supplier at thousands of employees. Our KSA subsidiaries are growing but we are not a solution to Saudi hiring at strategic scale.
What a vendor at our scale can credibly offer
What we actually deliver and what sits within Annota8’s ambition scope:
- Arabic-first, not Arabic-as-an-afterthought: Our operating model was built around Arabic with its dialectal and cultural nuance, not as an extension of an English production line
- PhD-grade linguistic QA: Our Cairo teams include researchers at academic level who can review SFT and RLHF sample quality to academic standards
- Comprehensive dialect coverage: Egyptian, Levantine, Gulf (Saudi + Emirati + Kuwaiti + Qatari + Bahraini + Omani), Maghrebi, Sudanese, Yemeni
- Safety eval authoring: Capacity to construct eval sets and red team samples with genuine cultural awareness for Saudi, Islamic, and Arab contexts
- RLHF preference data: Considered human preference mechanics with documented rubric and auditable trail
- In-KSA processing for sovereign workloads: Through our hybrid structure that absorbs the in-Kingdom processing constraint when required
Read our workforce architecture and data curation methodology.
The honest bottom line
HUMAIN in 2026 will spend the majority of its budget on infrastructure and chips, and no annotation vendor enters that segment. The segments accessible to us — ALLaM training data, safety evaluation, culturally-aware RLHF — are much smaller but they are real, and the value margin per labor hour in them is high. The vendor who enters with modest claims, transparent about what it does not do, builds long-term trust. The vendor that claims hyperscaler capabilities loses credibility in the first conversation.
To repeat: Annota8 is not a HUMAIN vendor of record today. This is general analysis from outside the procurement cycle, built on public announcements. I publish it because honest analysis serves the reader better than claiming knowledge I do not have. If you have information that contradicts anything in this post, I welcome the correction.