Skip to content
@HyperKuvid-Labs

HyperKuvid Labs

Fell into a semicolon, never came back. Chanting git push like a spell, filming our open-source downfall no rank ladders, iykyk đź‘˝

HyperKuvid Labs

A small group(3) of students experimenting with LLMs, inference systems, RL, and CUDA. We build things to understand how they work, not to ship products.

What We Do

We're focused on:

  • LLM inference optimization and serving strategies
  • Reinforcement learning for non-standard problems
  • CUDA programming and GPU-level performance work
  • Building end-to-end systems that combine multiple techniques

Projects

alphadesign — Hybrid AI framework combining reinforcement learning and genetic algorithms to optimize Formula 1 front wing aerodynamic designs. Features neural network-guided optimization, CFD analysis, structural constraint validation, and F1 regulatory compliance checking for accelerated design iteration.

frugalsot — An adaptive model selection system for efficient on-device NLP inference, enhancing speed, privacy, and resource use on edge devices.

phydra — End-to-end cargo management system with advanced 3D bin packing algorithms, A*/Dijkstra pathfinding implemented in C++ for ISS applications.

gideon — Emotion-aware LLM chat interface with Ollama, FastAPI, and React, featuring real-time image generation and project automation via AlphaStack agent. Accelerates full-stack and blockchain dev by 60%, blending local/cloud models for empathetic, context-rich interactions.

specquant — Scalable framework for adaptive LLM serving: classify prompt complexity → select quantized drafts → verify with FP16 target, no model retraining required.

alphastack — A universal, AI-powered development agent that supports any tech stack—battle-tested across 25+ full dev cycles. It intelligently scaffolds and iterates on complex projects with automated feedback loops to accelerate software delivery.

Contributing

If you're working on similar problems or have ideas worth testing, we're open to collaboration. Code-first, lightweight experiments preferred.

Pinned Loading

  1. PHYDRA PHYDRA Public

    End-to-end cargo management system with advanced 3D bin packing algorithms, A*/Dijkstra pathfinding implemented in C++ for ISS applications.

    C++ 1

  2. FrugalSOT FrugalSOT Public

    An adaptive model selection system for efficient on-device NLP inference, enhancing speed, privacy, and resource use on edge devices.

    TypeScript 1

  3. Gideon Gideon Public

    Emotion-aware LLM chat interface with Ollama, FastAPI, and React, featuring real-time image generation and project automation via Alpha<Stack> agent. Accelerates full-stack and blockchain dev by 60…

    Python

  4. AlphaDesign AlphaDesign Public

    Hybrid AI framework combining reinforcement learning and genetic algorithms to optimize Formula 1 front wing aerodynamic designs. Features neural network-guided optimization, CFD analysis, structur…

    Python 1 1

  5. energy_throttling_llms energy_throttling_llms Public

    Energy-aware DDPG RL framework that dynamically optimizes LLM speculative decoding parameters based on real-time hardware metrics (CPU/GPU temps, battery). Maintains 95-98% energy utilization to ma…

    Python

  6. elastic_continual_learning elastic_continual_learning Public

    This idea proposes an adaptive optimization framework designed to mitigate catastrophic forgetting in neural network training by dynamically selecting and applying state-of-the-art continual learni…

    Python

Repositories

Showing 10 of 18 repositories

Top languages

Loading…

Most used topics

Loading…