-
-
Notifications
You must be signed in to change notification settings - Fork 0
Controlling AI with AI. Conative Gating introduces a second model trained with inverted incentives rewarded for blocking, suspicious by default, adversarial to the LLM’s proposals, using metaphors from human constraint.
License
hyperpolymath/conative-gating
About
Controlling AI with AI. Conative Gating introduces a second model trained with inverted incentives rewarded for blocking, suspicious by default, adversarial to the LLM’s proposals, using metaphors from human constraint.
Topics
Resources
License
Code of conduct
Contributing
Security policy
Stars
Watchers
Forks
Releases
No releases published
Sponsor this project
Packages 0
No packages published