The proliferation of machine log data has the potential to give organizations unprecedented real-time visibility into their infrastructure and operations. With this opportunity comes tremendous technical challenges around ingesting, managing, and understanding high-volume streams of heterogeneous data. The Data Collection team owns the ingestion pipeline -- starting with a lightweight agent to collect, compress, encrypt, and ship the data back to the Sumo Logic cloud.
You will be responsible for designing and implementing advanced mechanisms to collect massive amounts of machine-generated data from heterogeneous systems in real-time. You will build asynchronous systems with high levels of concurrency, multithreading, and parallel programming. The Data Collection team is responsible for managing the data collection infrastructure and collection agents. Individual agents collect at rates of tens of thousands events per second.
You are a strong software engineer who is passionate about large-scale systems. You care about producing clean, elegant, maintainable, robust, well-tested code; you do this as a member of a team, helping the group come up with a better solution than you would as individuals. Ideally, you have experience with performance, scalability, and reliability issues of 24x7 commercial services.
Responsibilities
Requirements
Desirable