02.07.2026 KI unter Kontrolle

AI under control

Many of our articles have criticized the unpredictability of AI systems and accused those responsible of a lack of transparency. Of course, we want to go even further and call for democratic oversight of such companies and a regulatory body that acts in the public interest to determine which AI applications — not least because of these systems’ energy consumption — should be used at all. But let’s start small. It would already be a step in the right direction if we could verify whether an AI system (at least) adheres to its own specifications (terms of service, task descriptions, etc.).

That is exactly what the open-source software Praxen is designed to do: "it checks whether an AI agent does what it claims to do."

Praxen analyzes the policy specified for the AI system and then examines how the system behaves during operation. As a result, Praxen lists where and what differences exist between the stated requirements and reality. In doing so, Praxen draws on standard procedures used in project management.

Praxen examines all of these questions using the Live AI System and compiles its findings in detailed log files. In doing so, Praxen is not operating in a vacuum but is building upon the existing — albeit sometimes flawed — guidelines for AI systems. For Praxen, these currently include the OWASP Top 10 for LLM Applications 2025, the OWASP Top 10 for Agentic AI Applications 2026, the OWASP Secure MCP Server Development Guide 2026, and the RAISE Framework.

While we certainly cannot expect miracles from Praxen, we at least have clues as to where expectations and reality diverge, and can then continue our search in that direction as human beings. Analyzing the results still requires a fair amount of tact and judgment. In this way, Praxen evens out errors when, in repeated similar situations, the results swing one way or the other. The developers aim to improve this: "The goal is to make them visible, measurable, and recoverable so users can trust the results they receive."
Praxen is a start, a first step, but by no means a substitute for democratic oversight of Big Tech.

Read more https://www.helpnetsecurity.com/2026/06/24/praxen-open-source-ai-agent-behavior-verification/
and the software is free available at https://github.com/open-agent-ai-security/praxen


Category[21]: Unsere Themen in der Presse Short-Link to this page: a-fsa.de/e/3Qy
Link to this page: https://www.a-fsa.de/de/articles/9581-20260702-ki-unter-kontrolle.htm
Link with Tor: http://a6pdp5vmmw4zm5tifrc3qo2pyz7mvnk4zzimpesnckvzinubzmioddad.onion/de/articles/9581-20260702-ki-unter-kontrolle.htm
Tags: #AI #KI #GAFAM #BigTech #Prüfung #AGB #Vorgaben #Aufgaben #Kompetenzen #Trainigsdaten #Wahrheit #Manipulation #künstlicheIntelligenz #Projektcontrolling
Created: 2026-07-02 07:46:07


Kommentar abgeben

For further confidential communication, we recommend that you include a reference to a secure messenger, such as Session, Bitmessage, or similar, below the comment text.
To prevent the use of this form by spam robots, please enter the portrayed character set in the left picture below into the right field.

We in the Web2.0


Diaspora Mastodon Twitter Youtube Tumblr Flickr FsA Wikipedia Facebook Bitmessage FsA Song


Impressum  Privacy  Sitemap