Homepage - Nota Shafira Noh

STATUS LOG

// NOTICE: This blog space is currently undergoing architectural and content-focused upgrades. Some articles may temporarily remain as offline drafts.

WACANA & EKSPLORASI // ESSAYS & PAPERS

Latest Publications

AI Safety for Noobs (Public Education)

How I Broke My Own Image Classifier (And What It Taught Me About AI Safety)

I changed exactly one pixel in an image of a "3" — invisible to the human eye — and my neural network confidently declared it was an "8".

May 2026

Reflections

Are We Able to Reflect? — And Why 77% of Us Are Already Too Late

Explore cognitive sovereignty, systemic burnout, and the Islamic framework of Tafakkur from Al-Ghazali as a third path forward. (Written in Malay)

May 2026

AI Safety Research (Evals)

Testing Virtue in Military AI: Can Models Exhibit Compassion Under Pressure?

Running iVAIS evaluationson military datasets with Dr. Masaharu Mizumoto — measuring whether AI exhibits virtues like honesty, humility, and compassion in life-or-death scenarios. Interactive playground included.

Apr 2026

AI Safety for Noobs (Public Education)

Can Mechanistic Interpretability Detect Who Is Asking? (Elderly or Child)

Stop treating models like black boxes. Learn how to map, audit, and steer internal demographic circuits to identify if a child or an elderly person is asking the question.

Jan 2026

AI Governance

From Followers to Influencers: 2 Initial Ways Malaysia Can Set the Agenda for Global AI Governance

Estonia led Europe in digital governance; Singapore shaped model AI guidelines. Here is how Malaysia can leverage compute sovereignty to lead the Global South.

Mar 2025

AI Safety for Noobs (Public Education) [DRAFT LOG]

Linguistic Roots of AI Safety: An Entry Analysis of LLM Value Alignment

Investigating standard value metrics across diverse regional representations and translation constraints.

Feb 2025

Reflections [DRAFT LOG]

AI Safety: Alignment – After 10 Weeks

Reflections on deep system exploration, training mechanics, and alignment benchmarks.

Jan 2025

AI Governance [DRAFT LOG]

Reflection on My AI Governance Application

A journey mapping national frameworks, safety guidelines, and regulatory priorities.

Jan 2025

Peta Penyelidikan // Wacana Kognitif & Pengurusan Sumber Pengiraan