Members-Only
Recent Talks & Demos are for members only
You must be an AI Tinkerers active member to view these talks and demos.
Prism: Privacy-First AI Orchestration
See Prism, a privacy-first AI orchestrator, decompose queries, route to models like Qwen3, Gemini, and Groq, and synthesize responses, all while redacting secrets.
Prism is a privacy-aware AI orchestration layer that decomposes any query into
parallel sub-tasks, routes each to the right model (local Qwen3, Groq, or Gemini)
based on sensitivity, executes them concurrently via a dependency DAG, then
synthesizes a single coherent response — all in one WebSocket-driven interface.
Live demo will show: a real query decomposing into 3 parallel tasks, a DAG
rendering the execution graph in real-time, the privacy gate redacting secrets
before they leave the machine, and a local Qwen3.6-35B running on a laptop GPU
via llama.cpp with MoE CPU offload.
An empty GitHub repository containing no source code or files.
- llamaMeta's open-weights LLM family optimized for high-performance local deployment and custom fine-tuning across 8B to 405B parameter scales.Llama 3.1 delivers state-of-the-art performance through a flagship 405B parameter model trained on 15 trillion tokens. It supports a 128k context window: ideal for analyzing massive datasets or long-form documentation. Developers utilize Llama for diverse tasks (multilingual translation, Python code generation, and complex reasoning) while maintaining data sovereignty via local hosting. The ecosystem includes the Llama Stack for agentic workflows and optimized weights for 8B and 70B models, ensuring high throughput on consumer hardware or enterprise clusters.
- Qwen3Qwen3 is Alibaba Cloud's flagship, open-source LLM series: a high-efficiency model leveraging a Mixture-of-Experts (MoE) architecture and an adaptive Hybrid Thinking Mode.Qwen3 is a powerful, open-weighted LLM (Apache 2.0) from Alibaba Cloud, engineered for peak performance and efficiency. Its core design features a diverse model lineup, including dense models (0.6B to 32B) and efficient MoE variants like the Qwen3-235B (22B active parameters). The key innovation is the Hybrid Thinking Mode, which dynamically toggles between deep, step-by-step reasoning and fast, non-thinking responses. This model supports an extensive 119 languages and handles long-context tasks up to 128K tokens, making it a robust, versatile choice for advanced multilingual and agentic workflows.
- FastAPIFastAPI is a modern, high-performance Python web framework for building APIs with automatic OpenAPI documentation.FastAPI is a robust, high-speed Python web framework: it is built on Starlette (for async capabilities) and Pydantic (for data validation and serialization). Leveraging standard Python 3.8+ type hints, the framework automatically generates interactive API documentation (Swagger UI/ReDoc) and enforces data validation, effectively reducing developer-induced errors by an estimated 40%. This architecture delivers performance on par with Node.js and Go, significantly increasing feature development speed (up to 300% faster). It is production-ready, fully supporting OpenAPI and JSON Schema standards for all API specifications.
- FastAPI WebSocketsFastAPI WebSockets enable bi-directional, real-time communication between clients and servers with native async support and minimal boilerplate.FastAPI WebSockets leverage Starlette and Python's async/await syntax to handle persistent, full-duplex connections out of the box. By defining endpoints with the @app.websocket decorator, developers can easily manage incoming and outgoing data streams (supporting text, binary, and JSON payloads) while utilizing FastAPI's dependency injection system for tasks like authentication. This native integration eliminates the need for complex external wrappers, making it a highly efficient choice for building low-latency features like live chat applications, real-time dashboards, and collaborative tools.
- ReactReact is an open-source JavaScript library for building dynamic user interfaces (UIs).React is a component-based JavaScript library, developed by Meta (Facebook), engineered for building fast, declarative UIs. It mandates a one-way data flow and utilizes a Virtual DOM mechanism to ensure efficient, predictable updates to the user interface. Developers construct complex UIs by composing small, encapsulated components; this architecture promotes code reusability and simplifies state management across large applications. The library employs JSX (a syntax extension) to integrate HTML-like markup directly within JavaScript logic, supporting development for both web (React DOM) and native mobile platforms (React Native).