Speech Software Engineer

201-500

United States

Apply now

Join Talent Community

Location

New York, United States

Salary

(Yearly)

(Hourly)

Undisclosed

$215,000 – $235,000

What you'll do

Architect & Modernize: Lead the design and implementation of a scalable, high-availability voice infrastructure that replaces legacy systems.
Optimize Performance: Build and refine multi-threaded server frameworks capable of handling thousands of concurrent, real-time audio streams with minimal jitter and latency.
Build for Scale: Deploy robust ASR > LLM > TTS pipelines that process thousands of calls concurrently.
Stream Engineering: Develop robust logic for handling media streams, ensuring seamless audio data flow between clients and our ML models.
System Observability: Build advanced monitoring and load-testing tools specifically designed to simulate high-concurrency voice traffic.
Collaborate: Partner with Speech Scientists and Research Engineers to integrate state-of-the-art models into a production-ready environment.

What you'll need

Experience: 5+ years of software engineering experience, with a proven track record of building and maintaining production-grade infrastructure.
Industry Knowledge: A background in building ASR/TTS products at scale that interact with foundational LLMs.
Language Mastery: Expert-level proficiency in Golang, Python, or willingness to learn.
Voice Fundamentals: Deep understanding of audio processing, including sample rates, codecs (Opus, G.711), network protocols, and buffering strategies.
System Design: Strong background in object-oriented design and the ability to architect systems that are both modular and performant.
Growth Mindset: The ability to navigate and refactor large existing codebases while transitioning to new, more efficient architectures.

What we'd like to see

Cloud Native: Hands-on experience with Kubernetes, Docker, and cloud providers (AWS/GCP/Azure) for deploying distributed speech services.
Event-Driven Architecture: Familiarity with event loops (Boost.Asio, uvloop) and asynchronous programming patterns
Big Data: Experience with Hadoop, Spark, or Hive for analyzing massive datasets of speech logs to improve model accuracy.

215,000 - 235,000 a year

The compensation includes salary plus performance bonus. The actual salary may be different depending upon non-discriminatory factors such as qualifications, experience, and other factors permitted by law.

ASAPP is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, disability, age, or veteran status. If you have a disability and need assistance with our employment application process, please email us at careers@asapp.com to obtain assistance. #LI-AG1 #LI-Hybrid

Apply now

ASAPP is hiring a Speech Software Engineer. Apply through The Homebase and and make the next move in your career!

Apply now