[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Job Opportunities at Facebook

Facebook is hiring!  We are looking for a Systems Engineer/Architect and Site Reliability Engineer.  I have attached the job descriptions below.  If you are interested, please contact Michelle Bostock mbostock-at-facebook.com.  Thanks and Happy Holidays!

Systems Architect
Palo Alto, CA

Facebook is seeking a seasoned Systems Architect to join the Operations team. The position is full-time and is based in our main office in downtown Palo Alto and will report to the Manager of Systems Operations.


    * Analyze application flow and infrastructure design to improve performance and scalability of the site
    * Collaborate on design of services infrastructure from servers to networking
    * Monitor, analyze, and make recommendations as appropriate to improve site stability and availability
    * Evaluate hardware and software technologies to improve site efficiency and performance
    * Troubleshoot and solve issues with hardware, applications, and network components
    * Lead team efforts from design to implementation, prioritize tasks and resources while interacting with Engineering and Operations
    * Document current and future configuration processes and policies
    * Participate in 24x7 on-call support


    * B.S. in Computer Science or equivalent experience
    * 4+ years of experience in Operations with large web farms
    * Extensive knowledge of web architecture and technologies, including Linux, Apache, MySQL, PHP, TCP/IP, security, HTTP, LDAP and MTAs
    * Strong background/interest in application and infrastructure design
    * Scripting and programming skills
    * Excellent verbal and written communication skills
Site Reliability Engineer
Palo Alto, CA

Facebook is seeking talented operations engineers to join the Site Reliability Engineering team. The ideal candidate will have strong communication skills, a passion for tinkering with Linux, and an almost insane fondness for fast-paced, seat-of-your-pants troubleshooting and crisis management. The position is full-time and is based in our main office in downtown Palo Alto. This position reports to the Manager of Site Reliability Engineering.


    * Monitor the stability and performance of the website
    * Remotely troubleshoot and diagnose hardware problems
    * Debug issues with Linux software, applications and network
    * Resolve technical challenges encountered in LAMP technologies
    * Develop and maintain monitoring tools and automation systems
    * Predict and respond to utilization variances across multiple datacenters
    * Identify and triage all outage related events
    * Facilitate communication, coordinate escalation, and work with subject matter experts to implement critical fixes
    * Automate and streamline processes
    * Track issues and run reports


    * 2-3 years+ Linux support/sys admin experience in an Internet operations environment
    * BA/BS in Computer Science or a related field, or equivalent experience
    * Working knowledge of Linux, Cisco, TCP/IP, Apache and mySQL
    * Experience working with network management systems and monitoring tools, such as Nagios, Ganglia and Cacti
    * Competency in Shell, PHP, Perl or Python. C is a plus
    * Solid understanding of web services architecture and commonly employed technologies
    * A sense of urgency in responding to and resolving critical issues that relate to the performance of the site and/or core infrastructure
    * Excellent verbal and written communication skills
    * Participation in a shifted coverage schedule, including working nights and on-call rotations

Michelle Bostock | Facebook recruiting |
156 university avenue | palo alto, ca | 94301

Reply to: