Loading…
This event has ended. Visit the official site or create your own event on Sched.
Welcome to NYC School of Data — a community conference that demystifies the policies and practices around civic data, technology, and service design. This year’s conference concludes NYC’s annual Open Data Week & features 60+ sessions organized by NYC’s civic technology, data, and design community! Our conversations & workshops will feed your mind and empower you to improve your neighborhood. Follow the conversation #nycSOdata on twitter and tune into our live stream (provided by the Internet Society New York Chapter).

To attend, you need to purchase tickets via eventbrite. Venue is fully accessible and content is all ages friendly — free, professional on-site childcare is provided for ALL participants! If you have accessibility questions or needs, please email us at < schoolofdata@beta.nyc >.

View Sessions by detail - room - grid. If you have any questions, please see our welcome to 2020 post or FAQ.
Saturday, March 7 • 11:30am - 12:30pm
Developing in the open & rebuilding PLUTO

Log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.

NYC Planning’s Data Engineering team is transforming the way we think about Open Data in government by opening up the processes behind making public datasets.  As data engineers we develop new data products and modernize the creation of existing datasets, such as PLUTO, which is NYC’s definitive tax lot dataset that contains over 800k rows and 87 columns capturing lot level, building level, and geospatial attributes sourced from a dozen input data sources.  During this talk we’ll show how we re-engineered PLUTO and made sure that the data matched previous versions, discuss why it is important to us that the code to build PLUTO is available on GitHub, and describe where we’re going next, as an example of the type of work that we do.

Though, we’re not just excited about the data products we build, we’re equally passionate about how we build them.  In the later portion of the talk we’ll do a deep dive into a couple of the core technologies we use that enable us to iteratively integrate improvements, distribute our maintenance responsibilities, and generate products efficiently.


Speakers
avatar for Amanda Doyle

Amanda Doyle

Team lead, data engineering, NYC Planning
PLUTO and open datasets created by NYC Planning.
BC

Baiyue Cao

NYC Planning
BZ

Baoling Zhou

NYC Planning



Saturday March 7, 2020 11:30am - 12:30pm
3-302 - (60 ppl)