Thu, Feb 13 2020 at 06:00 PM at Lumere
By: Sam Mahisekar
Experience Level: Novice
If you have attended a few ChiPy events, chances are you have used the chipy.org website. The ChiPy Web Guild is a group of volunteers that help maintain the site. In this talk, I will give a brief description of how the Web Guild works and touch on some aspects of the ChiPy.org site. We will then go through an example of how team members were able to address a flaw in the ChiPy.org code enhancing user experience. Finally, I will share some thoughts on what I learned and what the group might work on next.
By: Nick Petrovits
Experience Level: Novice
Join us as we describe our migration from a limiting cloud deployment on long-running VMs with shared infrastructure to a streamlined immutable infrastructure built on top of Docker and K8s. We'll also discuss techniques to support local development during this transition. Many teams wish they could reap the widely known benefits of Kubernetes (K8s), but most struggle to migrate to a new infrastructure while simultaneously supporting two deployment models and avoiding impacts to the velocity of software development. In this talk, we describe the particular challenges we faced during our incremental migration from multiple long-running singleton EC2 instances to a containerized solution. We'll highlight: - What challenges motivated us to transition to K8s? - Approaching an infrastructure migration incrementally to minimize impacts to local development and production deployments - Developing a solution to provide the same abstraction for local development that exists in production - Concurrently supporting multiple deployment models to reduce risk and simplify migration - Strategy variations for synchronous and asynchronous services - Networking challenges with Vagrant and Docker - Integrating K8s with a CI/CD pipeline - Tuning the environment
(15 Minutes)
By: Michael McCarthy
Experience Level: Novice
The rapid growth of Python is, in part due, to it's exceptional toolkit for Data Analysts, Scientists, and Engineers. Packages like Pandas, Scikit-Learn, PySpark, and Dask have become staples for teams looking to process data. However, when processing large amounts of data there are times when Python might not be the right solution for your task. In this conversation, we'll learn about Cloud based Data Warehouses, such as Google's BigQuery, Amazon's Redshift, and Snowflake. You'll learn about the advantages of these platforms compared to in-memory processing in Python. We'll also show examples of how you can use Apache Airflow to automate recurring tasks, turning your Data Warehouse into the cornerstone of your Data Science infrastructure.
Thu, Dec 12 2019 at 06:00 PM at Braintree
(180 Minutes)
By:
Ten mentees will present the projects that they have been working on with their mentoors for the past 3 months.
Tue, Nov 19 2019 at 06:00 PM at Metis
Thu, Nov 14 2019 at 06:00 PM at Nielsen
(20 Minutes)
By: Ray Buhr
Experience Level: Novice
I made a package, pyplot-themes, that helps make it easier to: 1. have decent looking matplotlib/pandas plots 2. have some decent color palettes 3. create your plot themes https://pypi.org/project/pyplot-themes/
(30 Minutes)
By: Nikola Novakovic
Experience Level: Intermediate
Machine Learning is something you'll see referenced very frequently now in everything from marketing materials to sales pitches, and job postings. With so much hype it can be hard to distinguish what people mean when they say Machine Learning. In this talk we will demystify Machine Learning by understanding its core concepts and applying that knowledge to real world examples. We'll explain basic concepts like linear algebra and loss functions, figure out when to use machine learning and build an ML model that we'll be able to use in real world apps. Here’s an in-depth list of what we'll cover: * What Machine Learning is and where it’s being used * How to recognize when machine learning is necessary * Math 101 * Linear Regression * Live Coding Session Salary Estimator * Q & A
(30 Minutes)
By: Jordan Bettis, Meygha Bhat, Vamsi Guntamukkala
We will talk to you about Nielsen's Connect Platform, our global, unified, open data ecosystem powered by Microsoft Azure and how we're building platform components using Python. Specifically, we'll deep dive into object-oriented data flows. As more and more data scientists write software beyond statistical models, object thinking from the field of programming can help them write test-able, maintainable and reusable components.