Name: Developing in the open & rebuilding PLUTO
Start: 2020-03-07T11:30:00-0500
End: 2020-03-07T12:30:00-0500

Welcome to NYC School of Data — a community conference that demystifies the policies and practices around civic data, technology, and service design. This year’s conference concludes NYC’s annual Open Data Week & features 60+ sessions organized by NYC’s civic technology, data, and design community! Our conversations & workshops will feed your mind and empower you to improve your neighborhood. Follow the conversation #nycSOdata on twitter and tune into our live stream (provided by the Internet Society New York Chapter).

To attend, you need to purchase tickets via eventbrite. Venue is fully accessible and content is all ages friendly — free, professional on-site childcare is provided for ALL participants! If you have accessibility questions or needs, please email us at < schoolofdata@beta.nyc >.

View Sessions by detail - room - grid. If you have any questions, please see our welcome to 2020 post or FAQ.

Back To Schedule

Developing in the open & rebuilding PLUTO

Feedback form is now closed.

NYC Planning’s Data Engineering team is transforming the way we think about Open Data in government by opening up the processes behind making public datasets. As data engineers we develop new data products and modernize the creation of existing datasets, such as PLUTO, which is NYC’s definitive tax lot dataset that contains over 800k rows and 87 columns capturing lot level, building level, and geospatial attributes sourced from a dozen input data sources. During this talk we’ll show how we re-engineered PLUTO and made sure that the data matched previous versions, discuss why it is important to us that the code to build PLUTO is available on GitHub, and describe where we’re going next, as an example of the type of work that we do.

Though, we’re not just excited about the data products we build, we’re equally passionate about how we build them. In the later portion of the talk we’ll do a deep dive into a couple of the core technologies we use that enable us to iteratively integrate improvements, distribute our maintenance responsibilities, and generate products efficiently.

Speakers

Workshop, presentation and discussion

NYC School of Data 2020

Amanda Doyle

Baiyue Cao

Baoling Zhou

Molly Graber

NYC School of Data 2020

Log in to save this to your schedule, view media, leave feedback and see who's attending!

Amanda Doyle

Baiyue Cao

Baoling Zhou

Molly Graber