Job Opportunities at Facebook
Facebook is hiring! We are looking for a Systems Engineer/Architect and Site Reliability Engineer. I have attached the job descriptions below. If you are interested, please contact Michelle Bostock mbostock-at-facebook.com. Thanks and Happy Holidays!
Systems Architect
Palo Alto, CA
Description
Facebook is seeking a seasoned Systems Architect to join the Operations team. The position is full-time and is based in our main office in downtown Palo Alto and will report to the Manager of Systems Operations.
Responsibilities
* Analyze application flow and infrastructure design to improve performance and scalability of the site
* Collaborate on design of services infrastructure from servers to networking
* Monitor, analyze, and make recommendations as appropriate to improve site stability and availability
* Evaluate hardware and software technologies to improve site efficiency and performance
* Troubleshoot and solve issues with hardware, applications, and network components
* Lead team efforts from design to implementation, prioritize tasks and resources while interacting with Engineering and Operations
* Document current and future configuration processes and policies
* Participate in 24x7 on-call support
Requirements
* B.S. in Computer Science or equivalent experience
* 4+ years of experience in Operations with large web farms
* Extensive knowledge of web architecture and technologies, including Linux, Apache, MySQL, PHP, TCP/IP, security, HTTP, LDAP and MTAs
* Strong background/interest in application and infrastructure design
* Scripting and programming skills
* Excellent verbal and written communication skills
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Site Reliability Engineer
Palo Alto, CA
Description
Facebook is seeking talented operations engineers to join the Site Reliability Engineering team. The ideal candidate will have strong communication skills, a passion for tinkering with Linux, and an almost insane fondness for fast-paced, seat-of-your-pants troubleshooting and crisis management. The position is full-time and is based in our main office in downtown Palo Alto. This position reports to the Manager of Site Reliability Engineering.
Responsibilities
* Monitor the stability and performance of the website
* Remotely troubleshoot and diagnose hardware problems
* Debug issues with Linux software, applications and network
* Resolve technical challenges encountered in LAMP technologies
* Develop and maintain monitoring tools and automation systems
* Predict and respond to utilization variances across multiple datacenters
* Identify and triage all outage related events
* Facilitate communication, coordinate escalation, and work with subject matter experts to implement critical fixes
* Automate and streamline processes
* Track issues and run reports
Requirements
* 2-3 years+ Linux support/sys admin experience in an Internet operations environment
* BA/BS in Computer Science or a related field, or equivalent experience
* Working knowledge of Linux, Cisco, TCP/IP, Apache and mySQL
* Experience working with network management systems and monitoring tools, such as Nagios, Ganglia and Cacti
* Competency in Shell, PHP, Perl or Python. C is a plus
* Solid understanding of web services architecture and commonly employed technologies
* A sense of urgency in responding to and resolving critical issues that relate to the performance of the site and/or core infrastructure
* Excellent verbal and written communication skills
* Participation in a shifted coverage schedule, including working nights and on-call rotations
Michelle Bostock | Facebook recruiting |
156 university avenue | palo alto, ca | 94301
Reply to: