Omar Habra

AI Infrastructure & Compiler Engineer | Agentic Systems, Evals, LLVM

San Francisco, CAomarbro4all@gmail.com(415) 568-7167linkedin.com/in/ohabra github.com/bro4all ohabra.com

Summary

AI infrastructure and compiler engineer with experience across Swift/Clang, LLVM, RISC-V tooling, and production AI systems. Builds agentic workflows, retrieval/search pipelines, tool-use systems, structured outputs, evaluation harnesses, and AWS/Bedrock/Claude infrastructure. Strong where low-level systems depth meets reliable AI product delivery.

Experience

Founder & Senior Engineer — AI Products & Infrastructure

2023 – Present

Independent Consulting

RedlineAI — local-first AI contract review: Built a privacy-first macOS AI workflow for legal-document review with source-grounded extraction, severity-scored findings, and human-reviewable evidence trails.
DiffSwarm — multi-agent code review CLI: Built a Homebrew-installable PR review CLI that coordinates parallel Codex/Claude agents, tool-calling review passes, verifier evidence, and local BYOK execution.
AI Tutoring Platform: Built production AI infrastructure across retrieval (Postgres/pgvector), Bedrock/Claude orchestration, Next.js, Terraform-managed AWS, structured-output validation, and curriculum-aware regression checks.
GenAI Visual Processing: Improved image-pipeline latency and reliability through caching, batching, throughput tuning, and an evaluation harness for model regression detection.

Senior Software Engineer — RISC-V Toolchain

2022 – 2023

SiFive · San Mateo, CA

Improved RISC-V vector workload performance by tuning Clang/LLVM vectorization cost models and RVV codegen heuristics; added microbenchmarks to catch compiler performance regressions.
Integrated LLVM analysis tooling into FreedomStudio IDE and built Python automation for toolchain builds and RISC-V ISA analysis.
Automated cross-compilation, ISA validation, and performance benchmarking through CI pipelines with Ansible-managed environments.

Software Engineer — Clang/Swift Compiler

2020 – 2022

Apple · Cupertino, CA

Worked on Swift–C++ interoperability for the Swift/Clang toolchain, including AST bridging, ABI integration, and name-mangling support for cross-language calls.
Supported Apple Silicon compiler work by resolving ABI and backend issues across Swift, Clang, and LLVM for arm64 targets.
Built cross-architecture CI and log-triage tooling for x86_64/arm64 compiler regressions across Clang, Swift, and LLVM workflows.

Selected Projects

DiffSwarm↗ diffswarm.com

2025 – Present

Local-first multi-agent PR review CLI

Built a Homebrew-installable AI developer tool that reviews local diffs and GitHub PRs with Codex or Claude Code, plans focused review passes, verifies findings, and outputs actionable markdown reports while keeping code local through a BYOK model.

AI developer toolsCodexClaude CodeRustTypeScriptOpenTUIGitHubSecurity reviewBYOK

RedlineAI↗ red-line-ai.com

2026 – Present

Local-first AI contract review for macOS

Building a macOS legal AI workflow that reviews contracts locally with Codex or Claude Code, preserves evidence anchors, verifies findings against source text, and produces reviewable work product for NDAs, vendor agreements, DPAs, MSAs, and SOWs.

Applied AILegalTechmacOSSwiftRustLLM orchestrationSource verificationLocal-first AI

Open Source

Tenstorrent tt-metal↗

Nov 2025

Contributed TTNN model bring-ups to TT-Metal for Tenstorrent Wormhole accelerators, including end-to-end demos, tests, and performance artifacts.

PR #33123: DPT-Large (MiDaS 3.0) depth estimation TTNN implementation (Bounty #31290)
PR #32500: YOLOS-small object detection TTNN implementation (Bounty #30874)
PR #32335: MaskFormer Swin-B hybrid TT decoder + perf artifacts and TT/CPU tests

Swift Programming Language↗

2022

Contributed merged PRs to the Swift compiler around C++ interoperability, including getter/setter support and pointer type handling.

PR #40842: Implemented C++ getters/setters interoperability (Merged Mar 2022)
PR #40276: Fixed pointer type handling in C++ interop (Merged Nov 2021)
PR #58436: Cleanup after devirtualization optimization (Open Apr 2022)

Technical Skills

Compilers & Systems Engineering

LLVM · Clang · Swift compiler · RISC-V · Code generation · Vectorization · ISA optimization · TTNN / TT-Metal · Toolchain development · Performance tuning

Languages & Low-Level Skills

C++ · Python · Swift · TypeScript · JavaScript · Assembly · Computer Architecture · Systems Programming

AI Infrastructure & Agents

Agentic systems · Context engineering · Tool use / function calling · Structured outputs · Retrieval/search (RAG) · Agent harnesses · Evaluation harnesses · Hybrid search · Reranking · pgvector · Embeddings · AWS Bedrock · Claude · OpenAI API · Source-grounded verification · Context engineering · Model inference optimization · Caching strategies · Computer vision

Product & Web Development

React · Next.js · Node.js · GraphQL · REST APIs · ARKit · iOS Development · UI/UX · Full-stack development

Cloud & Infrastructure

AWS ECS Fargate · AWS Step Functions · AWS Lambda · AWS S3 · AWS RDS · Terraform · Docker · CI/CD · GitHub Actions · Ansible

Databases & Storage

PostgreSQL · pgvector · Redis · S3 · Vector databases

Security & Compliance

Multi-tenant architecture · RBAC · Data isolation · AWS Secrets Manager · Audit logging · Privacy-by-design

Observability & Reliability

Monitoring · Tracing · Cost optimization · Incident response · SLA management · Load testing

Education

San José State University

B.S., Computer Engineering

San José, CA · May 2020

Languages

Arabic — Native
English — Fluent

Beyond Engineering

Competitive Endurance Athletics
Interdisciplinary Scholar