Sorry, this job is no longer accepting applications. See below for more jobs that match what you’re looking for!

Senior Principal Data Scientist

Expired Job

Comcast Philadelphia , PA 19107

Posted 3 months ago

Comcast's Technology & Product organization works at the intersection of media and technology. Our innovative teams are continually developing and delivering products that transform the customer experience. From creating apps like TVGo to new features such as the Talking Guide on the X1 platform, we work every day to make a positive impact through innovation in the pursuit of building amazing products that are enjoyable, easy to use and accessible across all platforms. The team also develops and supports our evolving network architecture, including next-generation consumer systems and technologies, infrastructure and engineering, network integration and management tools, and technical standards.

Bring a combination of mathematical rigor and innovative algorithm design to create recipes that extract relevant insights from billions of rows of data to meaningfully improve Comcast user experience.

Are you passionate about network, digital media, entertainment, and software services? Do you like big challenges and working within a highly-motivated team environment? As a Data Scientist, you will research, develop, support and apply new techniques using real-time distributed computing architectures. You will employ your skills to deliver insights into customer and network behavior to drive business decisions that will shape the future of Comcast and be part of a team that thrives on big challenges, results, quality, and agility.

Who does the Data Scientist work with?

The Data Engineering and Science team as part of our Next Generation Access Network is a diverse collection of professionals who work with a variety of organizations ranging from software engineering teams whose software integrates with analytics services, network architect and engineers, to service delivery engineers who provide support for our product, testers, operational stakeholders with all manner of information needs, and executives who rely on data for data based decision making.

What are some interesting problems you'll be working on?

Develop models capable of processing millions of events per second and multi-billions of events per day, providing both a real time and historical view into the operation of our products and services. Work on high performance real time data stores and a massive historical data sets using best-of-breed and industry leading technology. Work closely with various engineering teams to solve key optimization, insight and access network data challenges.

Where can you make an impact?

The Comcast Next Generation Access Network Data Engineering and Science team is acquiring, studying, simulating, and modeling to enable data as a key driver and core functional component toward better understanding, predicting, and dynamically optimizing the core network to improve overall user experience. Success in this role is best enabled by a broad mix of skills and interests ranging from traditional distributed systems software engineering prowess to the multidisciplinary field of data science.


  • Building a strong intuitive understanding of the problem domain (Next Generation Access Networks). Identify testable hypotheses to explain interesting phenomena in the domain.

  • Selecting and transforming features, building and optimizing classifiers using machine learning techniques

  • Integrating data from multiple sources including third party sources.

  • Data mining using state-of-the-art methods

  • Enhancing data collection procedures to include information that is relevant for building analytic systems

  • Frequent meeting/communication with clients to interpret their needs, plan/organize, and discuss progress and results

  • Developing actionable quantitative models in the areas of effectiveness, ROI, pricing and optimization.

  • Doing ad-hoc analysis and presenting results in a clear manner

  • Creating automated anomaly detection systems and constant tracking of its performance

  • Creating automated evaluation environment of complex models and constant tracking of relevant performance

  • Develop and communicate goals, strategies, tactics, project plans, timelines, and key performance metrics to reach goals

  • Review, direct, guide, inspire the analytical work of more junior staff

Here are some of the specific technologies we use:

  • Spark (AWS EMR), AWS Lambda

  • Spark Streaming and Batch

  • Avro, Parquet

  • Kafka

  • MemSQL, Cassandra, HBase, MongoDB, RDBMS

  • Caching Frameworks(ElasticCache)

  • Elasticsearch, Beats, Logstash, Kibana

  • Java, Scala, Go, Python, R

  • Git, Maven, Gradle, Jenkins

  • Rancher, Puppet, Docker, Ansible, Kubernetes

  • Linux

  • Hadoop (HDFS, YARN, ZooKeeper, Hive), Presto

  • Keras, TensorFlow

Skills & Requirements:

  • Graduate degree or PHD in the following areas: Statistics, Data Science, Computer Science or Operations Research.

  • 6+ years working within an enterprise data lake/warehouse environment or big data architecture

  • Excellent understanding of machine learning techniques and algorithms, especially in the deep learning area -- both theoretical underpinnings and craft (Systems such as Tensorflow, Theano, Caffe, scikit.learn and their APIs).

  • Excellent applied statistics skills and understanding of probability distributions, statistical testing, regression, etc.

  • Experience with common data science toolkits, such as scikit-learn, R, etc. Excellence in at least one of these is highly desirable.

  • Great communication skills.

  • Experience with data visualization tools, such as D3.js, GGplot, Matplotlib, etc.

  • Proficiency in using query languages such as SQL and Hive.

  • Experience with NoSQL databases, such as Redis/ElasticCache, Cassandra, HBase

  • Good scripting and programming skills, such as Java, Scala, R, Python, or Spark

  • Data-oriented personality

Comcast is an EOE/Veterans/Disabled/LGBT employer

See if you are a match!

See how well your resume matches up to this job - upload your resume now.

Find your dream job anywhere
with the LiveCareer app.
Download the
LiveCareer app and find
your dream job anywhere

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Senior Software Engineer Data Science

Near-Miss Management

Posted Yesterday

VIEW JOBS 12/12/2018 12:00:00 AM 2019-03-12T00:00 <p>Near-Miss Management (NMM) is seeking an experienced and highly motivated software engineer to further develop and scale its unique browser-based data analytics platform. NMM provides an analytics and visualization software solution to multiple corporate clients in the chemical and refining industry. Within our small engineering team, your voice will be a valuable asset to product development.</p><p>In this role, you will be responsible for further development of NMM’s successful flagship analytics product by adding new model capabilities – some of which have already been identified. As a senior engineer, you will be in a unique position to define and undertake technical needs of the software across its entire stack. In addition to addressing business needs and feature development, you will also be able to autonomously help shape the future of the software. Enjoy flexible hours, work-from-home days, and a relaxed casual atmosphere as you work with experienced team members to help design, build, and deliver a robust, reliable software system.</p><p><strong>Requirements</strong></p><p> One of the main responsibilities of this role is the development and maintenance of an existing data analytics model, which is implemented in Python using the Numpy/Pandas stack. This role will be responsible for supporting the existing machine-learning based analytics while providing key insights for their extension and improvement. Aside from analytics, an important aspect of the role involves the processing pipeline. Also written using Python, the pipeline executes the analytics in a parallelized fashion, and assistance with moving towards a distributed processing paradigm would be valuable. </p><p><br></p><p><strong>Required Qualifications</strong></p><ul> <li>MS or PhD in Computer Science/Engineering, or related fields</li> <li>Extensive Numpy and Pandas experience</li> <li>Model validation and performance testing</li> <li>Take analytics models from prototype stage to deployment with scale and performance in mind</li> </ul><p><br></p><p><strong>Desired Qualifications</strong></p> <ul><li>1+ year of experience in data science and data engineering</li></ul> <ul> <li>Ability to comfortably manage version control using Git </li> <li>Experience with distributed systems design and architecture</li> <li>Jupyter Notebook prototyping </li> <li>General understanding of both NoSQL and relational databases and the ability to programmatically interface with them</li> <li>Experience with unit testing, integration testing, automated testing strategies, and continuous integration</li> <li>Understanding of object-oriented design principles and paradigms </li> <li>Experience with NodeJS, Javascript, TypeScript</li> </ul> Near-Miss Management Philadelphia PA

Senior Principal Data Scientist

Expired Job