NYC Planning’s Data Engineering team is transforming the way we think about Open Data in government by opening up the processes behind making public datasets. As data engineers we develop new data products and modernize the creation of existing datasets, such as PLUTO, which is NYC’s definitive tax lot dataset that contains over 800k rows and 87 columns capturing lot level, building level, and geospatial attributes sourced from a dozen input data sources. During this talk we’ll show how we re-engineered PLUTO and made sure that the data matched previous versions, discuss why it is important to us that the code to build PLUTO is available on GitHub, and describe where we’re going next, as an example of the type of work that we do.
Though, we’re not just excited about the data products we build, we’re equally passionate about how we build them. In the later portion of the talk we’ll do a deep dive into a couple of the core technologies we use that enable us to iteratively integrate improvements, distribute our maintenance responsibilities, and generate products efficiently.
NYC Private Housing Maintenance Problems - a Data Dashboard app
As an NYC tenant myself, I wanted to better understand the overall conditions of NYC private buildings, namely what type of maintenance problems people have to deal with.
Through the NYC Open Data portal, I accessed the main database of this study: 311 HPD Complaints (HPD, Dep. of Housing Preservation and Development is the agency that regulates housing conditions), where it is shown a list of tenant complaints about poor housing conditions on private multi-dwelling buildings. It includes problems like heating, electric, heat/hot water, unsanitary conditions, etc. Using this data, I applied statistics, GIS and R coding magic and I ended up learning insights about:
1. The proportion and type of maintenance problems that private tenants face, currently and historically, as well as emerging and ongoing problems.
2. Where are the complaint hotspots, what neighborhoods are the worst vs. best? And who is most affected by the overall NYC's bad housing conditions?
Check out the study and the data dashboard here: NYC Housing Problems NYC