We are looking for someone to lead our team of Site Reliability Engineers to develop automated techniques which harden our services, improve system up-time, and supplement our rapid-deployment model. This is a multi-hat role which will require both strong diagnostic skills paired with robust coding acumen. We design, develop and maintain a repository of 'infrastructure' software that lays the foundation for our engineers to rapidly iterate, automates system deployment, monitors system status, and assists consumers of our services in the deploying their analytics to our platform. Additionally we diagnose, track, and resolve operational issues as they arise, to keep the system stable and available.
- TO BE CONSIDERED FOR THIS POSITION YOU MUST HAVE AN ACTIVE TS/SCI W/ FULL SCOPE POLYGRAPH SECURITY CLEARANCE (U.S. CITIZENSHIP REQUIRED)
The ideal candidate draws upon a good foundation of programming experience using JVM languages (Kotlin/Java), Python, or GoLang, has fairly extensive Linux experience, and familiarity with Docker, PKI-based security, and networking. Any combination of experience with container orchestration architectures (Mesos/Marathon, Kubernetes), databases(SQL, NoSQL), web services, monitoring solutions(ELK, TICK), HDFS (Hadoop/Accumulo/Zookeeper), NiFi, configuration scripting(Salt/Puppet) and/or experience as a technical lead for a small team would all weigh heavily as assets.
Fourteen (14) years experience as a SE in programs and contracts of similar scope, type and complexity is required. Bachelor’s degree in System Engineering, Computer Science, Information Systems, Engineering Science, Engineering Management, or related discipline from an accredited college or university is required. Five (5) years of additional SE experience may be substituted for a bachelor’s degree.
-Contribute to the development of sections of systems engineering documentation such as System Engineering Plans, Initial Capabilities Documents, Requirements specifications, and Interface Control Documents
-Manage system requirements and derived requirements to ensure the delivery of production systems that are compatible with the defined system architecture(s)
-Assist with the development of system requirements, functional requirements, and allocation of the same to individual hardware, software, facility, and personnel components
-Coordinate the resolution of action items from Configuration Control Board (CCB) meetings, design reviews, program reviews, and test reviews that require cross-discipline coordination
-Participate in an Integrated Product Team to design new capabilities based upon evaluation of all necessary development and operational considerations
-Participate in the development of system engineering documentation, such as System Engineering Plans, Initial Capabilities Documents, Requirements Specifications, and Interface Control Documents
-Participate in interface definition, design, and changes to the configuration between affected groups and individuals throughout the life cycle
-Allocate real-time process budgets and error budgets to systems and subsystem components
-Derive from the system requirements an understanding of stakeholder needs, functions that may be logically inferred and implied as essential to system effectiveness
- Derive lower-level requirements from higher-level allocated requirements that describe in detail the functions that a system component must fulfill, and ensure these requirements are complete, correct, unique, unambiguous, realizable, and verifiable
-Generate alternative system concepts, physical architectures, and design solutions
- Participate in establishing and gaining approval of the definition of a system or component under development (requirements, designs, interfaces, test procedures, etc.) that provides a common reference point for hardware and software developers
- Define the methods, processes, and evaluation criteria by which the systems, subsystems and work products are verified against their requirements in a written plan
-Develop system design solution that satisfies the system requirements and fulfills the functional analysis
-Develop derived requirements for Information Assurance Services (Confidentiality, Integrity, Non repudiation, and Availability); Basic Information Assurance Mechanisms (e.g., Identification, Authentication, Access Control, Accountability); and Security Mechanism Technology (Passwords, cryptography, discretionary access control, mandatory access control, hashing, key management, etc.)
- Review and provide input to program and contract work breakdown structure (WBS), work packages and the integrated master plan (IMP)