Project Docs

AI-generated documentation — 132 projects. ← back to portfolio search

k3s App Builder Platform

A self-contained platform for deploying apps with plain language and Claude, running entirely within k3s on a single EC2 instance.

PythonFastAPIk3sDocker
⚠ 2 need attention
Remote Dev UI screenshot

Remote Dev UI

A web-based dual-pane file browser with an integrated terminal for managing local and EC2 files via SSH/SFTP.

TypeScriptReactExpressssh2
⚠ 2 need attention

AWS Cognito + ALB Auth Portal

A minimal employee access portal using AWS ALB for OAuth2 authentication and FastAPI for group-based authorization.

PythonFastAPITerraformAWS Cognito
✓ all on main

Real-Time ASR Service with NVIDIA NeMo Parakeet

A production-ready streaming speech-to-text API using NVIDIA's Parakeet-CTC-0.6B model, featuring WebSocket audio ingestion, Silero VAD, and FastAPI.

Python 3.12FastAPINVIDIA NeMoPyTorch
✓ all on main
nano-claw voice loop screenshot

nano-claw voice loop

A local, voice-powered AI agent that runs in your browser with Metal-accelerated speech recognition and tool execution.

TypeScriptPythonDockerWebRTC
⚠ 3 need attention

4kUpScalerWorker

Serverless-style image upscaling worker using AWS S3, SQS, and InvokeAI.

Pythonboto3AWS S3AWS SQS
✓ all on main

AWS_tools_for_pricing

Python scripts to export EC2 instance details and OpenSearch domain configurations for cost analysis.

Python 3boto3AWS CLICSV
✓ all on main

Audio2ScriptViewer

A Python micro-service that processes transcribed audio messages from AWS SQS and compiles them into a sorted CSV script.

Pythonboto3AWS SQSTerraform
✓ all on main

Auth0 to S3 Backend

A reference architecture for mapping Auth0 authenticated users to individual AWS S3 buckets with secure, pre-signed URL access.

JavaScriptPythonReactFlask
✓ all on main
Browser-Text-to-Speech-TTS-Realtime screenshot

Browser-Text-to-Speech-TTS-Realtime

A client-side, real-time neural text-to-speech engine using WebAssembly and ONNX.

TypeScriptReact 18ViteONNX Runtime Web
⚠ 48 need attention

COE-overview

Weekly aggregator for security and operational errors from Jira, Wiz, CrowdStrike, and Vibranium.

PythonSQLAlchemyAlembicPydantic
✓ all on main
FSM-generic (Voice OS) screenshot

FSM-generic (Voice OS)

A generic voice-first operating system framework that defines conversational apps as workflow state machines.

Python 3.11+FastAPIReactTypeScript
✓ all on main

Huberman Lab Podcast Transcripts

A curated collection of MS Word and Markdown transcripts for episodes 1–30 of the Huberman Lab Podcast.

MarkdownMicrosoft Word
✓ all on main

Palindrome-Index

A Python utility that identifies the index of a character whose removal converts a string into a palindrome.

PythonStandard Library
✓ all on main

S3 Presigned URL Lambda API Gateway Setup

A Serverless Framework template for generating secure, rate-limited S3 presigned upload URLs via API Gateway and Lambda.

AWS LambdaAmazon S3Amazon API GatewayServerless Framework
⚠ 2 need attention

TranscriptionAPI-S3-backend

A lightweight Flask microservice that generates AWS S3 presigned URLs for secure audio file uploads.

PythonFlaskBoto3AWS S3
✓ all on main

UFOBriefing

Static intelligence portal for UAP/UFO disclosure research, powered by the Intelligence Briefing Toolkit.

TypeScriptNext.jstsxNode.js
⚠ 1 need attention
WhisperLive screenshot

WhisperLive

A real-time speech-to-text server and client ecosystem using OpenAI's Whisper with WebSocket streaming.

PythonJavaScriptWebSocketsPyAudio
✓ all on main

address-validator

FastAPI service that resolves messy/voice-transcribed US addresses to USPS standards via phonetic fuzzy matching and calculates driving routes using OpenStreetMap.

PythonFastAPIhttpxJellyfish
✓ all on main

Agent Router

A federated, multi-agent cockpit that spawns Gemini, Claude, and Codex in real PTYs and bridges them to a browser workspace.

PythonFastAPIuvicornwebsockets
⚠ 10 need attention
art-starry-night-flowfield screenshot

art-starry-night-flowfield

WebGPU-accelerated particle flow field visualization inspired by Van Gogh's Starry Night.

JavaScriptWebGPUWGSLp5.js
✓ all on main

Audio Orchestrator

A Python-based service that manages audio file processing workflows by coordinating S3 uploads, database task tracking, and SQS worker queues.

PythonFlaskPostgreSQLAWS SQS
✓ all on main

audio-uithin

Browser-based audio recorder with chunking functionality.

HTMLJavaScriptWeb Audio API
✓ all on main

audio-ui-cf-s3-lambda-cognito

A deprecated serverless personal cloud platform featuring secure audio recording and file management via AWS Cognito, Lambda, S3, and CloudFront.

Node.jsAWS LambdaAmazon S3Amazon CloudFront
✓ all on main

audio-ui-realtime-transcribe

Serverless platform for browser-based chunked audio recording, S3 storage, and file management with planned real-time transcription.

HTML/CSS/JSNode.jsAWS LambdaAmazon S3
✓ all on main

audio-workernode

A Python-based distributed worker node for audio transcription tasks, designed to run on RunPod infrastructure and communicate with a central orchestrator.

PythonRequestsYAMLRunPod SDK
✓ all on main
audio_client_server screenshot

audio_client_server

A web-based audio recording client that streams sound to a Python Flask server for real-time transcription and processing.

PythonFlaskJavaScriptPyJWT
⚠ 4 need attention

No Prob Bob — Customer Discovery Agent

A conversational AI agent that guides customer discovery interviews, extracts insights, and saves structured seed documents to Afterburner projects.

Python 3.11+OllamaFaster-WhisperPiper TTS
⚠ 1 need attention
browser-RAG-retrieval-realtime-night-index-SQLLiteWASM-and-sqllite-vec-portal-vector-db-with-filters screenshot

browser-RAG-retrieval-realtime-night-index-SQLLiteWASM-and-sqllite-vec-portal-vector-db-with-filters

A fully offline, in-browser vector database for real-time RAG retrieval using SQLite WASM and client-side embeddings.

TypeScriptSQLite WASMTransformers.jsVite
⚠ 6 need attention
Browser-Native RAG Retrieval Layer screenshot

Browser-Native RAG Retrieval Layer

A fully local, WebGPU-accelerated RAG retrieval system that runs embedding and vector search entirely in the browser with zero network calls at query time.

TypeScriptViteTransformers.jshnswlib-node
⚠ 4 need attention
VoiceStream ASR screenshot

VoiceStream ASR

A production-ready real-time Automatic Speech Recognition service using browser APIs and WebSocket streaming.

TypeScriptReactExpress.jsWebSocket (ws)
⚠ 2 need attention

browser-llm-local-ai-chat

A zero-build, vanilla JS browser chat interface supporting Ollama, OpenAI, Anthropic, and in-browser WebGPU/WASM inference.

JavaScriptES ModulesWebGPUWebAssembly
✓ all on main
Browser Voice Agent screenshot

Browser Voice Agent

A fully browser-native voice agent using local WebGPU LLMs, TTS, and STT with no server requirements.

React 19TypeScriptVite 7Tailwind CSS v4
✓ all on main
Browser Whisper Models Local Showcase screenshot

Browser Whisper Models Local Showcase

A client-side web application for private, offline speech-to-text transcription using OpenAI's Whisper models via WebGPU or WebAssembly.

HTML5JavaScriptWebGPUWebAssembly
✓ all on main

browser_mobile_debug_panel

One-tap debug report button for mobile web apps that captures console logs, browser info, and app state to S3.

TypeScriptReactAWS LambdaAmazon S3
⚠ 2 need attention

cal-provider

Unified async Python library for Google Calendar and CalDAV with an optional MCP server.

PythonAsyncioGoogle Calendar APICalDAV Protocol
✓ all on main
Browser-Native RAG Retrieval Layer screenshot

Browser-Native RAG Retrieval Layer

A fully local, WebGPU-accelerated RAG retrieval system that runs entirely in the browser with zero network calls at query time.

TypeScriptViteTransformers.jsWebGPU
⚠ 4 need attention

classVsInstance

Educational Python implementation demonstrating class vs instance variables and basic OOP logic.

Python 3OOP
✓ all on main

claude-chat-workspace

Lightweight self-hosted chat interface for Claude using Node.js, Express, and SSE.

Node.jsExpressDockerServer-Sent Events
⚠ 4 need attention

Claude Memory Bridge

A Chrome extension that detects specific memory markers in Claude AI chats and bridges them to external storage via webhooks or serverless functions.

JavaScriptChrome Extensions API (Manifest V3)AWS SDKHTML/CSS
✓ all on main

Cloudflare Zero Trust Setup Wizard

Interactive portal and CLI for provisioning secure, identity-gated localhost tunnels via Cloudflare.

TypeScriptReactViteExpress
⚠ 2 need attention

codefresh-release-stats

A Python script to count Codefresh Kubernetes releases per cluster over the last week.

PythonRequestsJSONDatetime
✓ all on main

CloudDrive: Serverless Personal Cloud with Audio Recording

A secure, mobile-first personal cloud platform featuring chunked audio recording, file management, and event-driven architecture on AWS.

AWS LambdaAmazon S3Amazon CloudFrontAmazon Cognito
✓ all on main

Deploy Portal (Capsule Cloud)

A self-service web portal for provisioning SSH access and deploying applications to an EC2 gateway instance protected by AWS Cognito authentication.

Python 3.8+FlaskBoto3Shell Scripting
✓ all on main

Easy Cognito Nginx Gateway Auth

A production-ready authentication gateway that protects web applications using AWS Cognito, nginx, and oauth2-proxy without requiring code changes.

BashNginxoauth2-proxyAWS Cognito
✓ all on main

EventBridge Orchestrator

Infrastructure-as-code repository defining AWS EventBridge schemas, rules, and utility lambdas for a decoupled microservices architecture.

AWS EventBridgeTerraformNode.jsAWS Lambda
⚠ 2 need attention

everyone-ai

A self-service internal developer platform that launches isolated Claude Code workspaces on Kubernetes via a web portal.

Node.jsExpressKubernetes (k3s)Helm
⚠ 2 need attention

exampleLoops

Educational code samples demonstrating basic loop structures in Java and Python.

JavaPython
✓ all on main

flowstate

A declarative Finite State Machine engine for multi-turn conversational AI that uses YAML to define states, typed slots, and per-state model selection.

PythonYAMLJSONAnthropic SDK
✓ all on main

flowstate-evolve

Analyzes Flowstate workflow traces to recommend graph improvements and generate auditable before/after reports.

PythonPyYAMLJSONLArgparse
✓ all on main

generate_title_headline_hooks

An integrity-checked headline generation pipeline that separates proposal, judgment, and verification to prevent hallucinated quotes and unsupported claims.

PythonFastAPIPydanticSQLite
✓ all on main

GitHub Portfolio Search

Semantic search engine for personal GitHub repositories using embeddings and TF-IDF.

PythonSQLite-vecsentence-transformersFastAPI
✓ all on main

GPT-4 & LangChain PDF Chatbot

A Next.js application that enables natural language querying of multiple large PDF documents using GPT-4, LangChain, and Pinecone vector storage.

TypeScriptNext.jsLangChainPinecone
⚠ 50 need attention

grassy-knoll

A generic voice-first operating system framework that defines conversational apps as workflow state machines.

Python 3.11+Anthropic APIJSONLAsyncIO
✓ all on main
Headshot AI Starter screenshot

Headshot AI Starter

Open-source SaaS starter for generating professional AI headshots using Astria, Next.js, and Supabase.

TypeScriptReactTailwind CSSShadcn UI
⚠ 2 need attention

hello-world

A minimal tutorial repository for Hello World by David Mar.

✓ all on main

hello-world-test

A minimal GitHub repository used for testing branch edits and README updates.

GitMarkdown
✓ all on main

html-framework-record-ver2.0

A lightweight, web-only audio recorder that captures PCM data and stores it in IndexedDB for persistent local playback.

JavaScriptWeb Audio APIIndexedDBScriptProcessorNode
✓ all on main

ibt-ufothin

Stub repository for a UAP knowledge graph and MetaPath2Vec correlation engine.

✓ all on main

Intelligence Briefing Toolkit

A standalone research pipeline that searches the web, extracts claims via LLMs, accumulates evidence, and generates intelligence briefings.

PythonFastAPIAnthropic SDKPydantic
⚠ 4 need attention

interviewnotes

A structured framework and checklist for conducting five-part technical and behavioral engineering interviews.

MarkdownGit
✓ all on main
iPhone & Desktop Companion TTS/SST Talking App screenshot

iPhone & Desktop Companion TTS/SST Talking App

Mac-hosted Python voice assistant streaming WebRTC audio to iPhone with a hybrid FSM workflow engine for complex research.

PythonaiohttpWebRTCWhisper
⚠ 2 need attention
iPhone Streaming Plus Finite State Machine screenshot

iPhone Streaming Plus Finite State Machine

Mac-hosted Python voice assistant streaming TTS to iPhone Safari via WebRTC with a hybrid FSM workflow engine for complex queries.

PythonWebRTCaiohttpWhisper
⚠ 3 need attention
iphone-webrtc-TURN-speaker-streaming-machost-iphonebrowser screenshot

iphone-webrtc-TURN-speaker-streaming-machost-iphonebrowser

Stream generated audio from a Mac host to an iPhone browser via WebRTC and TURN, featuring a voice agent loop with STT, LLM, and TTS.

PythonTypeScriptWebRTCWebSocket
⚠ 2 need attention

iran-intel

Static intelligence portal for Iran war investigation signals, powered by the Intelligence Briefing Toolkit.

TypeScriptNext.jstsxJSON
✓ all on main

knightsMovesRecursive

A Python script that calculates valid knight moves on a chessboard using recursive logic and coordinate translation.

PythonStandard Library
✓ all on main

Liveblocks Next.js Starter Kit

A production-ready template for building collaborative applications with real-time features using Next.js and Liveblocks.

TypeScriptNext.jsReactLiveblocks
⚠ 2 need attention

llm_gpt3.5-vs-gpt4-comparison

A cost-optimized document summarization pipeline using GPT-3.5 that achieves comparable or better results than GPT-4 by leveraging logical paragraph segmentation and hierarchical summarization.

PythonLangChainOpenAI APINumPy
✓ all on main

login-system

Serverless authentication backend using AWS Lambda, DynamoDB, and JWT.

JavaScriptAWS LambdaDynamoDBJWT
⚠ 4 need attention

mcp-highlighter

A Chrome extension and MCP bridge system that captures LLM-formatted memory blocks from web pages and persists them for AI context.

TypeScriptNode.jsExpress.jsModel Context Protocol SDK
⚠ 2 need attention

my-blogthin

Placeholder repository for personal blog writing.

✓ all on main

nvidia-RNN-T-Parakeet

A low-latency, GPU-accelerated speech-to-text API server using NVIDIA Parakeet RNN-T and Riva.

PythonShell ScriptFastAPINVIDIA Riva
✓ all on main

nvidia-parakeet

Production-ready deployment system for NVIDIA Parakeet RNNT ASR via Riva with WebSocket streaming.

ShellPythonNVIDIA RivaFastAPI
✓ all on main

NVIDIA Parakeet Riva ASR Deployment System

Production-ready deployment pipeline for NVIDIA Parakeet RNNT speech recognition via Riva ASR with WebSocket streaming.

Python 3.10+FastAPIWebSocketsgRPC
⚠ 1 need attention

NVIDIA Riva Conformer-CTC Streaming ASR

Production-ready real-time speech transcription using NVIDIA Riva 2.19 with Conformer-CTC streaming model via WebSocket bridge.

PythonShellNVIDIA RivagRPC
✓ all on main

NVIDIA Riva RNN-T Real-Time Transcription System

Ultra-low latency real-time audio transcription system leveraging NVIDIA Riva RNN-T on AWS GPU instances.

PythonBashFastAPINVIDIA Riva
⚠ 2 need attention

NVIDIA RNN-T Production Transcription System

Automated deployment of a GPU-accelerated SpeechBrain Conformer RNN-T speech recognition server with WebSocket streaming and S3 integration.

PythonBashFastAPIPyTorch
✓ all on main

openai_transcribe

A lightweight Node.js HTTP server that proxies audio files to the OpenAI Whisper API for transcription.

Node.jsJavaScriptHTTPOpenAI API
✓ all on main
Open Source Sound Generator LLMs screenshot

Open Source Sound Generator LLMs

A local browser-based benchmarking suite for comparing nine open-source audio generation models with real-time progress streaming and Apple Silicon optimization.

Python 3.13+FastAPIReactVite
⚠ 4 need attention

phone-agent-keiko-tv

A Python-based AI phone agent for keiko.tv that handles customer service calls, manages contacts via SQLite, and integrates with FSM-generic for state-driven scheduling.

PythonSQLiteHTTPXJavaScript
⚠ 1 need attention

phone-agent-scheduler

A real-time voice AI agent for trades businesses that handles phone calls, checks Google Calendar availability, and books appointments using a deterministic FSM and LLM safety nets.

PythonFlaskTelnyx APIGoogle Calendar API
✓ all on main

Picture QR Code

A serverless, password-gated shared photo album accessed via QR code, using AWS Lambda and S3.

AWS LambdaAmazon S3PythonHTML5
✓ all on main

Picture QR Code Login

Serverless shared photo album with identity-aware login (Google/Email) and per-photo attribution.

AWS LambdaAmazon S3Amazon CloudFrontHTML5/JavaScript
✓ all on main

pictures-valentina

Automated pipeline for generating a 16-year school portrait poster from an Apple Photos library.

PythonPillowOpenCVrembg
✓ all on main

pythonArrays2

A Python script that reads an array of integers from standard input and prints them in reverse order.

Python 2sys module
✓ all on main

rag-document-chat

A Python-based Retrieval-Augmented Generation (RAG) system for document Q&A using FastAPI, Streamlit, ChromaDB, and OpenAI.

PythonFastAPIStreamlitChromaDB
⚠ 2 need attention

RAG Document Chat System v2

A modular, production-ready Retrieval-Augmented Generation system with hierarchical document processing and paragraph-aware search.

Python 3.9+FastAPIStreamlitNext.js
⚠ 4 need attention

recording_app

Browser-based voice memo recorder with chunked S3 streaming and local faster-whisper transcription.

PythonJavaScriptAWS LambdaAWS S3
⚠ 5 need attention

riff

Voice agent platform that generates deterministic FSM-based voice flows from plain English business descriptions.

PythonYAMLMermaidPytest
⚠ 8 need attention

riot

A lightweight Python socket server and curses-based client for tracking taxi driver performance ratings.

Python 2SQLite3TCP SocketsCurses
✓ all on main
search-tool-provider screenshot

search-tool-provider

Unified async web search interface supporting 6+ backends with MCP server, CLI, and admin UI.

PythonAsyncioFastAPIRich
✓ all on main

sip-voice-transport

PSTN phone call transport for Mac-local voice AI supporting Telnyx and Twilio.

Python 3.11+FastAPINumPyUvicorn
✓ all on main
Smart Transcription Router screenshot

Smart Transcription Router

Hybrid AWS architecture routing audio transcription between real-time GPU FastAPI and batch SQS processing based on server health.

PythonShellFastAPIAWS Lambda
✓ all on main

spa_react_javascript_auth0

Auth0 React SPA authentication sample with auxiliary S3 audio upload utilities.

ReactJavaScriptReact Router 6Auth0 SDK
⚠ 2 need attention
Iris Kade: Local Browser-Native Conversational AI screenshot

Iris Kade: Local Browser-Native Conversational AI

A zero-network, WebGPU-accelerated conversational AI with dual-LLM architecture, multi-lane RAG, and streaming TTS/STT running entirely in the browser.

TypeScriptViteWebGPUWeb-LLM
⚠ 2 need attention
Iris Kade v2: Local Browser AI with Intent Workflow screenshot

Iris Kade v2: Local Browser AI with Intent Workflow

A zero-network, WebGPU-powered conversational AI featuring an embedder-based intent classifier and a visual workflow editor for stateful dialogue.

TypeScriptWebGPUONNX RuntimeVite
⚠ 2 need attention
Speaker Workflow System screenshot

Speaker Workflow System

A visual editor and runtime engine for declarative, multi-turn conversational workflows defined in JSONL.

TypeScriptViteVitestVanilla DOM API
⚠ 2 need attention

spot-setupthin

A thin repository intended for setting up GPU spot instances optimized for running Whisper.

Shell
✓ all on main

SSH Helper

A lightweight, web-based SSH terminal emulator with IP whitelisting and external authentication support.

Node.jsJavaScriptExpressws
✓ all on main

stringsAndLoops

A Python script that separates even and odd indexed characters from input strings.

Python 2sys moduleraw_inputstdout.write
✓ all on main

stripe-single-subscription-ec2

A Python Flask backend and vanilla JS frontend implementing Stripe Checkout for single subscriptions, configured for EC2 deployment.

Python 3.8+FlaskStripe APIJavaScript
⚠ 2 need attention

SuperTokens S3 Example

A reference implementation of passwordless authentication using SuperTokens, Go, and Docker on AWS EC2.

GoSuperTokensDockerPostgreSQL
✓ all on main

SuperTokens Passwordless Node.js Demo

A minimal prototype demonstrating passwordless authentication via magic links using SuperTokens, Node.js, and Docker.

Node.jsExpressSuperTokensDocker
✓ all on main

switchboard-ai

Append-only coordination channel for AI agents and humans via HTTP and JSONL.

PythonaiohttpJSONLHTTP REST
✓ all on main

FriendNet

A private network dashboard that aggregates mDNS service discovery from multiple Tailscale-connected homes into a single view.

Python 3.9+FastAPIReactVite
⚠ 4 need attention

Persona RAG Engine

A multi-domain intelligence platform that isolates analytical personas and evidence corpora to prevent data bleed while enabling specialized, voice-consistent Q&A.

Python 3.12+FastAPIChromaDBAnthropic Claude API
⚠ 17 need attention

tool-browserAutomationScripts

macOS CLI and library for listing Chrome profiles, decrypting cookies, and launching browser instances.

TypeScriptNode.jsCommander.jsbetter-sqlite3
⚠ 2 need attention

tool-messaging-projectthin

Empty or stub repository for a messaging tool project.

Git
✓ all on main

tool-s3-cloudfront-push

A shell-based static site deployer that crawls a running local website and pushes it to AWS S3 and CloudFront for low-cost, HTTPS-enabled global hosting.

BashAWS CLIWget/CurlJSON
✓ all on main

Video Picture Annotator Review

A Playwright-style tool for analyzing screen recordings, annotating UI elements with AI, and generating visual diff reports.

PythonClickRichOpenCV
✓ all on main

whatsup

Multi-transport project messaging for Afterburner sprints via Telegram, WhatsApp (planned), and console.

Python 3.11+argparserequestsJSON
✓ all on main
Traceable Searchable ADR Memory Index screenshot

Traceable Searchable ADR Memory Index

A framework for traceable, searchable project memory with multi-agent sprint automation and architecture decision tracking.

BashJavaScriptTypeScriptPython
⚠ 2 need attention

Phone Agent Training Pipeline

A pipeline for generating synthetic phone conversation data and fine-tuning local LLMs (via LoRA on Apple Silicon) to act as automated plumbing receptionists.

PythonMLXLoRAJSONL
✓ all on main

transcriber-2-pass-riva-conformer-cf-s3-lambda-cognito-adapter-2025-10-14

Hybrid architecture combining NVIDIA Riva Conformer-CTC for real-time streaming ASR with an AWS serverless backend for secure, chunked audio storage and session management.

PythonTypeScriptShellgRPC
⚠ 2 need attention

transcription-end-to-end-docs

Documentation and integration guide for a serverless transcription pipeline using AWS EventBridge, Cognito, and Lambda.

AWS EventBridgeAWS LambdaAmazon S3Amazon CloudFront
✓ all on main

transcription-realtime-whisper

Production-ready real-time speech transcription system supporting WhisperLive edge proxy and NVIDIA Riva enterprise architectures.

BashPythonNode.jsTypeScript
⚠ 3 need attention

CloudDrive: Real-Time Transcription Platform

A production-ready SaaS for real-time speech-to-text using WhisperLive, AWS Cognito, and serverless backend.

HTML5JavaScriptNode.js 18.xAWS Lambda
⚠ 2 need attention

transcription-sqs-spot-s3

Scalable AWS-based audio transcription system using SQS queues, EC2 Spot instances, and WhisperX/Voxtral models.

PythonBashAWS EC2AWS SQS
⚠ 2 need attention
ui-audiorecorder-html-sample screenshot

ui-audiorecorder-html-sample

Reference implementation for recording independent, playable audio chunks in the browser to avoid WebM container header issues.

HTML5JavaScript (ES5)MediaRecorder APIIndexedDB
✓ all on main
Voice Calendar Scheduler FSM screenshot

Voice Calendar Scheduler FSM

A 24/7 voice-driven apartment viewing scheduling assistant using Twilio/WebRTC, an 8-step FSM, and Google Calendar integration.

Python 3.11+FastAPITwilioWebRTC
✓ all on main

voice-frontend-modules

Composable Python infrastructure for voice AI apps: WebRTC transport, edge authentication, and pluggable STT/TTS/LLM engines.

PythonFastAPIWebRTCWebSocket
✓ all on main
Voice Base — Browser-to-Server Voice UI screenshot

Voice Base — Browser-to-Server Voice UI

A minimal, self-contained voice conversation system connecting browser microphone to Claude via WebRTC, using local Whisper for STT and Piper for TTS.

Python 3.11+FastAPIaiortcfaster-whisper
✓ all on main

voice-optimal-RAG

Self-contained Dockerized RAG service for semantic document search using LanceDB and Nomic embeddings.

PythonFastAPILanceDBSentenceTransformers
⚠ 2 need attention
Voice Print screenshot

Voice Print

Real-time speaker diarization and voice fingerprinting system using multiple embedding models.

Python 3.11+FastAPIWebSocketsONNX Runtime
✓ all on main

website-cloner

A six-phase static site cloning tool that enumerates URLs, downloads assets, rewrites links, detects dynamic content, and deploys to AWS S3.

Node.jsJavaScriptES ModulesCheerio
✓ all on main

whisper-runpod

Containerized Faster Whisper transcription service optimized for RunPod with S3 integration.

DockerShell ScriptingFaster WhisperPython
✓ all on main

whisperX-runpod

Scripts and handlers for deploying WhisperX with speaker diarization on RunPod or AWS EC2 GPU instances.

PythonShellDockerFastAPI
✓ all on main

whisperlive-runpod

Deployable real-time speech-to-text service using OpenAI Whisper on RunPod GPU infrastructure.

PythonDockerWebSocketsRunPod API
✓ all on main
whisperlive-salad screenshot

whisperlive-salad

Real-time speech-to-text server using OpenAI Whisper, optimized for Salad Cloud GPU deployment.

PythonWebSocketsFaster-WhisperTensorRT
✓ all on main

youtube_commercial_detector

A utility that estimates YouTube ad frequency by downloading videos and analyzing their text transcriptions for commercial keywords.

Pythonyoutube-dl/yt-dlpRegular ExpressionsFile I/O
✓ all on main

YouTube Phrase Scanner

A serverless, GPU-accelerated pipeline that downloads YouTube videos, transcribes them with WhisperX, and scans for specific phrases.

Python 3.7+WhisperXyt-dlpffmpeg
✓ all on main

YouTube Phrase Scanner

Serverless pipeline for downloading, transcribing, and scanning YouTube videos for specific phrases using WhisperX and AWS infrastructure.

PythonWhisperXDockerAWS Lambda
✓ all on main