Methodological foundations

How we conduct research

„Which standards make research results verifiable beyond the lab that produced them?"

A piece of research is only as robust as the methodology that produced it. This page documents the operational standards of our work — from the pre-registration of individual research decisions to the external validation pathways against which we measure ourselves.

This page is written for research institutions, funding bodies and public authorities who want to assess the methodological rigour we apply. It extends the methodology overview on the research page with the operational detail.

Methodological standards do not emerge from a single manifesto. They emerge from concrete decisions made in day-to-day research. We document here those decisions that make our work verifiable by third parties.

The research process

From hypothesis to publication

Every research output passes through the same process — iterative, not linear. Ablation and negative results feed back into hypothesis revision.

Pre-registrationOpenTimestamps anchor

ReproducibilityCode · data · configs

External validationPeer review · opinions

PublicationOpen access · DOI

↻ Ablation & negative results → hypothesis revision

Pre-registration

Anchoring research decisions in time before the work begins

Central research documents are time-anchored via OpenTimestamps. The cryptographic hash of a document is committed to a public blockchain, which creates a priority record independent of any later publication date and verifiable without involvement of the author.

OpenTimestamps is open infrastructure, not an institutional intermediary. Third parties can verify the timestamps later without any cooperation from the researcher.

What we register before the work starts

Research questions and hypotheses before data collection
Architectural decisions together with their reasoning
Intended evaluation procedures, including thresholds and success conditions
Falsification conditions — what would refute the hypothesis

Reproducibility

Replication as default, not an add-on

Research findings are only robust if third parties can reproduce them under clearly stated conditions. For us, replication material belongs with the publication, not in a later appendix.

Software is released under open-source licences (Apache 2.0 for ZenBrain, a mix of Apache 2.0 and MIT for evaluation scripts). Configurations, hyperparameters and seeds live in the code repository — so that a forgotten hyperparameter cannot render a study irreproducible.

Where our replication materials are publicly hosted

ZenBrain source@zensation/algorithms, @zensation/core Preprint with replication materialarXiv 2604.23878 Persistent datasets and scriptsZenodo DOI 10.5281/zenodo.19353663

External validation

Who we submit our results to for scrutiny

Research that certifies itself has not been examined. We build the architecture so that external validation is possible at several points — not only after the work is finished, but along its phases.

Which validation pathway we use depends on the subject matter. For algorithmic contributions, peer review is the academic standard. For safety-relevant components, notified-body pre-assessments and legal opinions on fundamental-rights compatibility come into play.

Academic peer review

Follow-up publications are being prepared for peer-reviewed venues. While that work is underway, preprints are publicly available on arXiv and Zenodo, so that the contributions remain visible independently of any review status.

Notified-body pre-assessments

For the civil-liberties architecture of the Public-Safety track we have held preparatory talks with notified bodies. TÜV SÜD, TÜV Rheinland and Bureau Veritas have experience with conformity assessments under the EU AI Act. These exchanges are non-binding; once the system is used in production, they would be transferred into a formal conformity assessment.

Legal opinions

Architectural decisions with fundamental-rights implications are reviewed by lawyers. The assessment focuses in particular on compatibility with GDPR Art. 89, Regulation (EU) 2024/1689 Art. 5, and the Brokdorf doctrine of the Federal Constitutional Court (BVerfGE 69, 315).

Endorsement procedures of academic identifiers

The arXiv endorsement procedure carried out by established researchers is a form of upstream validation: an endorser confirms that a contribution meets the academic standard of the relevant category. ORCID, Semantic Scholar and Google Scholar link the publication with verifiable researcher profiles.

Negative results and ablation

What did not work is part of the result

Research that only publishes successes is methodologically incomplete. Ablation studies show which components contribute what — and whether a finding is the result of a single architectural decision or of their interplay.

Our ablation register documents hypotheses that were refuted and architectural variants we discarded. This makes it traceable why the final architecture takes the form it does, rather than another.

Ablation registry in the repository

In the internal development repository, backend/src/algorithms/ablation.ts is maintained as a feature-flag registry. It allows individual published algorithms to be disabled for comparative studies — for instance to isolate the contribution of a specific memory layer.

Documented discards

Architectural variants we examined and discarded — certain reranker configurations or rejected memory topologies, for instance — are documented with their rationale. Later revisiting of a discarded path is therefore a conscious decision, not an oversight.

Publication of negative findings

Follow-up publications will explicitly flag negative and counterintuitive findings instead of editing them out in favour of a smoother narrative. Replications that show divergent results will be linked from the publications page.

Data minimisation

Methodological reduction to what is necessary

Data minimisation is not only a data-protection obligation under GDPR Art. 5(1)(c); it is a methodological decision. Annotation schemes that capture more than the research question requires conflate objects of study and undermine the explanatory value of the findings.

In the Public-Safety track, in practice: we work with skeleton and motion data, not with facial features. Trajectories are aggregated within bounded analytical windows, not chained across spaces or time. No biometric templates are created.

Annotation at construct level

Annotations capture the construct addressed by the hypothesis — not everything an annotator might observe. Inter-rater reliability with a Cohen's κ ≥ 0.61 target safeguards the quality of the operationalisation.

Pseudonymisation in research data

Research processing under GDPR Art. 89 is pseudonymised wherever technically possible. Re-identifiability is limited to what the research question actually requires.

Three-outlier validation

Selection decisions are repeatedly validated against outliers — three independent validation steps before data is used. This prevents biases in sample selection from quietly entering the architecture.

External standards

Which standards we measure ourselves against

These standards are not ours — they are established in their respective communities. We orient our methodology to them.

FAIR principles

Findable, Accessible, Interoperable, Reusable (Wilkinson et al., 2016). Research data and software are stored so that they are discoverable via persistent identifiers (DOI, ORCID, GitHub) and remain reusable under Apache 2.0 or CC BY 4.0.

ML reproducibility checklists

The reproducibility checklists established in the ML discipline — maintained by JMLR, ICML and comparable venues — are the reference point for reporting in publications: data splits, seeds, hyperparameters, compute budget, confidence intervals.

EU AI Act Art. 50

Outputs of AI-supported systems are labelled in accordance with Art. 50 of Regulation (EU) 2024/1689 once they are presented to natural persons. Labelling is not a downstream patch but part of the output pipeline.

GDPR Art. 89 in conjunction with BDSG § 27

Research processing is carried out under the safeguards of GDPR Art. 89: pseudonymisation, data minimisation, purpose limitation, technical and organisational measures.

Open access standards

Publications appear as preprints on arXiv (CS.AI) and are persistently anchored via DOI on Zenodo. Software is released under Apache 2.0, replication material under CC BY 4.0.

Open methodological questions

Where our methodology is still being developed

Methodological maturity is a process. We name openly where our standards are still evolving.

External replication

The ZenBrain algorithms are public, but independent replications by third parties are still outstanding. As soon as such replications become available we will link them from the publications page — including those that report divergent findings.

Inter-rater reliability for safety-relevant constructs

Annotation schemes for the Public-Safety track are currently being developed internally. External annotation by independent annotators — for instance within the framework of a consortial project — would be methodologically desirable.

Long-term studies

The retention curves modelled in our memory algorithms are validated in the short- to medium-term range. Multi-year follow-ups require research infrastructure we cannot provide on our own — collaborations with universities or non-university research institutions would be a methodologically appropriate route here.

Confidence intervals in more complex pipelines

For Bayesian confidence propagation we report 95% confidence intervals. For more complex pipeline stages — such as GraphRAG retrieval accuracies under realistic data distribution — CI reporting is still being expanded.

Methodological discussion

Inquiries on methodology and validation

We are happy to answer concrete methodological questions — for instance about replication material, ablation studies or validation designs. For those interested in joint methodological work, we welcome exploratory conversations.

Send a methodology inquiry

How we conduct research

„Which standards make research results verifiable beyond the lab that produced them?"

The research process

From hypothesis to publication

Every research output passes through the same process — iterative, not linear. Ablation and negative results feed back into hypothesis revision.

Pre-registrationOpenTimestamps anchor

ReproducibilityCode · data · configs

External validationPeer review · opinions

PublicationOpen access · DOI

↻ Ablation & negative results → hypothesis revision

Pre-registration

Anchoring research decisions in time before the work begins

OpenTimestamps is open infrastructure, not an institutional intermediary. Third parties can verify the timestamps later without any cooperation from the researcher.

What we register before the work starts

Research questions and hypotheses before data collection
Architectural decisions together with their reasoning
Intended evaluation procedures, including thresholds and success conditions
Falsification conditions — what would refute the hypothesis

Reproducibility

Replication as default, not an add-on

Research findings are only robust if third parties can reproduce them under clearly stated conditions. For us, replication material belongs with the publication, not in a later appendix.

Where our replication materials are publicly hosted

ZenBrain source@zensation/algorithms, @zensation/core Preprint with replication materialarXiv 2604.23878 Persistent datasets and scriptsZenodo DOI 10.5281/zenodo.19353663

External validation

Who we submit our results to for scrutiny

Academic peer review

Notified-body pre-assessments

Legal opinions

Endorsement procedures of academic identifiers

Negative results and ablation

What did not work is part of the result

Our ablation register documents hypotheses that were refuted and architectural variants we discarded. This makes it traceable why the final architecture takes the form it does, rather than another.

Ablation registry in the repository

Documented discards

Publication of negative findings

Data minimisation

Methodological reduction to what is necessary

Annotation at construct level

Pseudonymisation in research data

Research processing under GDPR Art. 89 is pseudonymised wherever technically possible. Re-identifiability is limited to what the research question actually requires.

Three-outlier validation

External standards

Which standards we measure ourselves against

These standards are not ours — they are established in their respective communities. We orient our methodology to them.

FAIR principles

ML reproducibility checklists

EU AI Act Art. 50

GDPR Art. 89 in conjunction with BDSG § 27

Research processing is carried out under the safeguards of GDPR Art. 89: pseudonymisation, data minimisation, purpose limitation, technical and organisational measures.

Open access standards

Publications appear as preprints on arXiv (CS.AI) and are persistently anchored via DOI on Zenodo. Software is released under Apache 2.0, replication material under CC BY 4.0.

Open methodological questions

Where our methodology is still being developed

Methodological maturity is a process. We name openly where our standards are still evolving.

How we conduct research

From hypothesis to publication

Anchoring research decisions in time before the work begins

Replication as default, not an add-on

Who we submit our results to for scrutiny

Academic peer review

Notified-body pre-assessments

Legal opinions

Endorsement procedures of academic identifiers

What did not work is part of the result

Ablation registry in the repository

Documented discards

Publication of negative findings

Methodological reduction to what is necessary

Annotation at construct level

Pseudonymisation in research data

Three-outlier validation

Which standards we measure ourselves against

FAIR principles

ML reproducibility checklists

EU AI Act Art. 50

GDPR Art. 89 in conjunction with BDSG § 27

Open access standards

Where our methodology is still being developed

External replication

Inter-rater reliability for safety-relevant constructs

Long-term studies

Confidence intervals in more complex pipelines

Inquiries on methodology and validation

Related pages

How we conduct research

From hypothesis to publication

Anchoring research decisions in time before the work begins

Replication as default, not an add-on

Who we submit our results to for scrutiny

Academic peer review

Notified-body pre-assessments

Legal opinions

Endorsement procedures of academic identifiers

What did not work is part of the result

Ablation registry in the repository

Documented discards

Publication of negative findings

Methodological reduction to what is necessary

Annotation at construct level

Pseudonymisation in research data

Three-outlier validation

Which standards we measure ourselves against

FAIR principles

ML reproducibility checklists

EU AI Act Art. 50

GDPR Art. 89 in conjunction with BDSG § 27

Open access standards

Where our methodology is still being developed

External replication

Inter-rater reliability for safety-relevant constructs

Long-term studies

Confidence intervals in more complex pipelines

Inquiries on methodology and validation

Related pages