About the Team:
Zark Lab is building foundation models for blockchain transactions and information. Our work focuses on search, retrieval, and generative modeling applied to on-chain and off-chain data. We are developing systems that enable efficient indexing, retrieval-augmented generation (RAG), and vector search for structured and unstructured blockchain datasets. Our models process billions of transactions and smart contracts across multiple blockchains, applying sequence modeling, graph-based learning, and language models to extract insights and improve data accessibility.
The team consists of former founders, senior engineers, and executives from Google, Meta, Goldman Sachs, and other leading technology and financial institutions. Our backgrounds span large-scale distributed systems, machine learning, engineering, and information retrieval, and we are focused on advancing the state of AI-driven search and computation for blockchains.
What you will do:
- Build large-scale web scraping and ingestion pipelines for on-chain and off-chain blockchain data
- Develop and optimize search architectures, integrating vector search, ANN retrieval, and ranking models
- Fine-tune LLMs for query expansion, semantic search, and retrieval-augmented generation (RAG)
- Reduce query latency through index optimization, ANN search, and distributed execution
- Scale distributed indexing pipelines for efficient storage, deduplication, and retrieval
- Optimize distributed storage and compute with Snowflake, ClickHouse, RocksDB, and vector databases
- Build scalable systems to process high-throughput blockchain transactions and queries
- Deploy and optimize cloud workloads on GCP with Kubernetes and containerized processing
You might thrive in this role if you:
- BS/MS/PhD in Computer Science or a related field.
- 5+ years of experience in AI/ML, distributed search, or large-scale data processing.
- Strong programming skills in Python, TypeScript, or Node.js.
- Expertise in database design (SQL and NoSQL) and high-throughput data systems.
- Experience with web crawling, data scraping, and large-scale ingestion pipelines.
- Knowledge of vector search, retrieval-augmented generation (RAG), and embedding models (preferred).
- Hands-on experience with GCP, Kubernetes, and Docker for cloud-scale deployment.
- Passion for blockchain, AI search, and distributed systems.
Why join us?
- Work on cutting-edge generative AI and search technologies applied to blockchain
- Solve complex challenges in large-scale indexing, vector search, and AI-powered retrieval
- Be part of a high-caliber team of former founders, engineers, and executives from leading tech and financial firms
- Competitive salary and equity opportunities
- Fully remote team with a fast-moving, high-impact culture
- Own and shape the future of AI-driven search and retrieval for Web3
If you’re excited about search, generative AI, and real-time blockchain inference, we’d love to hear from you.
Listed in: Cryptocurrency Jobs, Engineering Crypto Jobs, AI Crypto Jobs, Machine Learning Crypto Jobs, Remote Web3 Jobs, Web3 Web3 Jobs.