-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fields to be imported? #3
Comments
We haven't gotten that far yet! Would love some help figuring that out. |
The building data looks like this: Official documentation for the building outlines: http://egis3.lacounty.gov/dataportal/2011/04/28/countywide-building-outlines/ Of those attributes, I propose we keep the height, and discard the rest. Source and date (LARIAC, 2008) will be added to the changesets. The address data looks like this: Official documentation for the address points is here: http://egis3.lacounty.gov/dataportal/2012/06/19/la-county-address-points I'm not an expert about OSM's addressing scheme, so any input on this part would be useful. We will need to make extensive changes to the |
Amazing timing, I just remembered about and showed this issue to @meetar this afternoon. We'll take a look and chime in with any thoughts on buildings, and @missinglink or @sevko might have some input on addressing from their work on Pelias. |
@almccon , the dataset should map nicely onto the
The LA dataset looks pretty granular, so you might want to investigate the more esoteric tags ( |
According to the wiki, the Makes sense to ignore height < 1 and just leave off the height tag for those. |
Regarding building fields, it may be worth preserving an id field ( I also suggest preserving the elevation in an |
Regarding the building id, there's both BLD_ID and AIN. I'm not sure which one is more stable, or what they're used for. I know that the NYC import was a special case in that the city had a clear plan about how they would update their own data following changes in OSM. If LA County doesn't have such a plan, then building IDs may not be as useful. Generally there seems to be a consensus forming in OSM that IDs are not very useful in imported data. But we can finalize that decision after we've consulted with the imports committee. For now, we should find out whether BLD_ID and/or AIN will be the same in the 2014 data. I'm tracking that possibility in issue #6. Can someone who knows more about the 2014 data (@cityhubla?) find out if which IDs will be the same? |
The AIN should be the same, these are the legal numbers of the parcels and anything pertaining thereof, which includes the buildings. These numbers change rarely if only the owner of the property decides to the split their property into pieces (subdivide). So the data should remain the same. The parcel data, contains attributes to the type of building it is, (single story, mixed use, commercial, retail, church, school, theater, parking lots, , government buildings, etc) to which we could add. I've emailed UCLA and County for their data use disclaimer regarding issue #2 I'll shoot an email to the County regarding their plan, if the BLD_ID is kept consistent with the not yet released 2014 data. |
Okay, check cdd4342 for my first pass at the conversion code. I know for sure I'm not catching all of the fields that I should be. I'd love someone else to take a look at it. I'm happy with how the address points are joining with the buildings (where possible). If you use my chunks_venice.zip sample files, and run |
Regarding the Unique building IDs, the NYC import has them included, it may be best to add them as well. I could contact LA county to see if the 2014 LARIAC bldg outlines (non-public) are completely new IDs and different from the 2008 public set we're using. I'm anticipating the new set to have outlines with recently constructed buildings replacing existing ones, we could then automate this in the future by removing those IDs that are no longer there with the new ones. I'm also wondering if there is a way to retag the removed outlines for archiving. |
Also, the data behind the elevations are off at a number of buildings, this 3D demo I made of Hollywood ThreejsQGIS demo shows the discrepancy. You will see some buildings elevated way off from a normally sloping area. At the office I work at, we use the outlines to build our 3D contextual site models for conceptual architectural work. I usually have to recenter the object axes to then intersect with the terrain in software like Rhino3d or Sketchup. |
It may take quite a bit of time to average them out, if we script something where it takes a feature and determines whether it's ELEV is within the average of the closest surrounding buildings. |
We may have some luck with the incoming LA County Open Data Initiative it seems the county assessor is actually going to release the roll data for each parcel for free, (more up to date than what I found from UCLA). Would this help in the code development? One attribute that would be very beneficial is the "building type" the roll data identifies the building as either a single family home, commercial, mixed, civic, school, church, etc. The text says that the data portal would be accessible on or after March 30th. |
Oh, that does look quite interesting! I'd love to be able to use http://wiki.openstreetmap.org/wiki/Tag:building%3Dhouse In fact, many of those, like churches, schools, hospitals, could also be captured. And @lyzidiamond and @joeyklee would love |
http://wiki.openstreetmap.org/wiki/Tag:building%3Dgarage would also be great for all those garages behind single-family homes |
Just had a thought: does the assessor data contain information about the number of floors in the building? If so, we can populate the building:levels tag as well as the height which comes from the building footprint data. |
For the case of LA, I don't think it is included (http://assessor.lacounty.gov/local-roll/), but I do think the "building:levels " tag would be a worth adding since it could be something derived via geotagged photos or google streetview imagery :) In cases like the Chicago building footprint data, they include building:levels for some of their buildings. |
Bummer. If it's not in the assessor data then we can't include it in the import. But there's nothing stopping people from adding it later based on ground-level imagery or surveying on-the-ground. We can't use Google Street View for OSM mapping because of the license, but we can use Mapillary where available: http://www.mapillary.com/map/im/2FuBwfL320GgjY5amgiCgw/photo |
That is no bueno. And definitely good point about the licensing. Always something to keep in mind ;) |
@jschleuss and I made this google spreadsheet to figure out the fields and their OSM tag Attributes to be imported, Google Spreadsheets We can use this spreadsheet to determine which values go or omitted. There are two distinct uses that the 2014 Assessor's data (also referred to issue #15) has:
Jon and I propose that the these tags could be attached during the import.
For example if a building is detached single family residence its tag is as follows
OSM's taginfo shows residential as a general tag, we could consider the building tag as general. We think there should be a general tag for general designations like residential, commercial, industrial, with a subtag like building:use as whether a residential is either single family, duplex, triplex condominum. The data in the assessor's is fine grain, we're open to tagging them differently. Thoughts? |
The SpecificUseDetail2 of the 2014 Assesor's data has these values
We could update the script to add these to the building:levels tag |
@cityhubla Nice find! I'm sorry I missed that. It's been awhile since I revisited the data. Better two brains than one! |
The only issue is that this category has values like Modular, pool, vacant land, fast food etc. It would have to be sorted during scripting. Those identified as 14-20 or 6-13 would need to be omitted as the tag is number based, right? |
Here is a breakdown of all the unique values in the use categories, Google Spreadsheet |
I'm figuring out the markdown syntax for adding tables to the readme, We could transition the gdoc to the readme or create a git_wiki to simplify what attributes we're importing |
This is great! I think we can add these tables to the README in Markdown, or else put them in the wiki on osm.org. Eventually we'll have to document everything on OSM's wiki anyway. But we shouldn't use the github wiki, since that will just confuse things. |
@cityhubla okay. I added the 2015 Assessor data to that spreadsheet. I also started thinking about the OSM tags, starting with our thoughts above. I think we could work with OSM's tags a bit with building=house or building=school. And then go generic when we don't have more information. We could augment the script to look for both values before making a "decision" on the tags to add. Maybe we do something similar for SpecificUseType1 and 2? |
@almccon you know the assessor's data also has address fields, right? But we're opting for the address points because some buildings will have multiple addresses? Is that right? |
No, I did not know that the assessor's data has addresses. But you're correct, if the assessor only has one address per parcel then the address points will be preferable. |
Table has been added to the readme, I'll update the osm wiki. From the looks of it, and what Jon and I talked last Sunday, our import will be really comprehensive. Lots of great data. |
If anyone has time, the assessor's data has values in the USE columns that could be sorted into specific OSM tags, like the |
Just went through the discussions and it looks like we can categorize the fields into two parts:
Per this trial #18 (comment) I did notice that around 1 in 20 buildings had changed and were not good to be imported. If we tie in the address fields to the building footprints while importing, we will have to discard both the footprints and address together, but ideally we would want to import the address as a point property or add it to a newer footprint if possible. How about we split the dataset into polygons with just the footprint and building attributes, and a point dataset of address attributes extracted from building centroids? This will allow us to better fill data gaps in OSM rather than trying to throw all the data in at once:
|
I took a stab at reviewing the Assessor categories. I think it contains a lot of information that can fine tune the What I did was first compare the In most cases, we will override the Tags we can include are Pros - we adopt common OSM convention on tagging buildings. We transalte as much info as possible from the source. Next Actions
|
PR for review here. #28 |
@maning in #3 (comment) what are the two categories*? |
@talllguy, the yellow = residential, purple = industrial. The area northwest is an industrial complex. |
@maning interesting. I suspect a zoning boundary caused this. Having worked for a County gov't GIS team, I wouldn't be surprised if such a method was used to tag building uses by an assessor. You might try loading the zoning boundary if you can find it on LA county's open data site to compare. My prediction is that the zoning line will transect the building right where the split is, because zoning lines were historically mapped on small scale maps by hand, long before GIS was around. Then when GIS came around, GIS analysts were forced to digitize the zoning lines exactly where they were on the map, because changing them at all requires a law change. I digress. Check that zoning line and then if it is the culprit, you might just want to us an algorithm to decide to merge the larger piece, or ignore these zoning inferred tagging. Also, regarding institutional, my county used that as well. In my import, I believe I changed them all to building=yes, because there was no positive way to say what they translated to. Typically they're school, college or government though. |
You are correct we used LA County's Assessor data by joining the building shapefile and assessor csv using
You are correct again, |
Closing. |
Hi, could you let us know which of the fields in the original dataset you plan to import, and what the corresponding proposed OSM tags will be? Apologies if I missed this somewhere!
The text was updated successfully, but these errors were encountered: