Sponsored links

Go Back   MP3Car.com > Mp3Car Technical > Crowdsourced Data Collection, Open Source Mapping And Routing


Reply
 
Share Thread Tools Display Modes
Old 07-06-2009, 09:37 AM   #16
Variable Bitrate
 
thekl0wn's Avatar
 
Join Date: Apr 2005
Location: PoCo, Indiana
Posts: 247
thekl0wn is on a distinguished road
The sentences are standards... Therefore, I vote they be stored intact, line-by-line, in sequence on the server. At that point, any further information can be propagated from there. Surely, the server will have a mildy idle time where it can perform daily "builds" of the data.

A user uploads the formatted GPS output (like in the .txt file attached above) to the FTP. A parser application runs, which opens each new file, and inserts each sentence into a table, and then moves the text file into a save folder under a formatted name.

This is the most basic of the tasks, but it will also be an incredibly processor intensive one. All other data could easily (though, not fast) be propagated from a raw data table.
__________________
Planning [----X-----] 40%
Programming [-X-------] 20%
Parts [-----X----] 50%
Install [X--------] 5%

See Me In A Pink Skirt
thekl0wn is offline   Reply With Quote
Sponsored links
Advertisement
 
Advertisement
Old 07-06-2009, 09:47 AM   #17
darth sidious lite
 
Fiberoptic's Avatar
 
Join Date: Jul 1978
Location: Baltimore, MD
Posts: 1,181
Blog Entries: 128
Fiberoptic will become famous soon enoughFiberoptic will become famous soon enough
Quote: Originally Posted by Sicarius View Post
Another question to answer is how many points per unit time do you want? Most people are going to be producing 1 point per second but mine is going to be putting out 10 per second.

I think 10Hz GPS outputs might be very valuable. What hardware are you using to collect that stream?
__________________
Check out our video blog. 100 videos and counting
Fiberoptic is offline   Reply With Quote
Old 07-06-2009, 12:35 PM   #18
Constant Bitrate
 
Join Date: May 2008
Posts: 173
Sicarius is on a distinguished road
Quote: Originally Posted by Fiberoptic View Post
I think 10Hz GPS outputs might be very valuable. What hardware are you using to collect that stream?

The Wintec WBT-300 / G-Rays I.
Sicarius is offline   Reply With Quote
Old 07-06-2009, 02:42 PM   #19
Admin. Don't bug or I'll byte.
 
Bugbyte's Avatar
 
Join Date: Sep 2004
Location: Corning, NY
Posts: 6,143
Bugbyte is a splendid one to beholdBugbyte is a splendid one to beholdBugbyte is a splendid one to beholdBugbyte is a splendid one to beholdBugbyte is a splendid one to beholdBugbyte is a splendid one to beholdBugbyte is a splendid one to behold
@ FO and ecog - How about we do some experiments with NMEA data?

Why don't we compare a couple of datasets and compare the results of:

1. binary
2. zipped
3. raw

Pick some track data from a GPS that is 1 minute, 10 minutes, 20 minutes, 30 minutes and 1 hour long.

Figure the data sizes for all of the options and let's graph them to get an idea of how big the data storage requirements would be. Remember to take into consideration backups of the data both in time and in size/cost.

Then, we should see what kind of processing has to take place on the data to turn it into openstreetmap track information and run it against the datasets. You'll have to do this in the future to process the data and it would be a good idea to see if that would scale.

Then, we can calculate storage size and processing times for 1, 10, 100, 1000, and 10,000 users.

No sense starting it off one way and then finding out it won't scale.
__________________
Want to:
-Find out about the iBug?
-Stop being a newbie? Take a look at the FAQ Emporium?
-Find out about carPC's in just 5 minutes? View the Car PC 101 video
-Help me kill my car PC
-Watch live video streams from my mobile PC? Check it out here.
-Where is the iBug?
Bugbyte is offline   Reply With Quote
Old 07-08-2009, 12:03 PM   #20
darth sidious lite
 
Fiberoptic's Avatar
 
Join Date: Jul 1978
Location: Baltimore, MD
Posts: 1,181
Blog Entries: 128
Fiberoptic will become famous soon enoughFiberoptic will become famous soon enough
For now we have created this account with the following permissions:

-upload files to the server
-view all the uploaded files in that directory
-Can't delete files
-can't download files

IP:12.167.132.204
ftp port 21:
username: datadump
password: datadump


File Format:
TicketID_DDMMYYYY_HHMMSS.txt
Get your ticket id: http://www.mp3car.com/ticket.php
The forum username on the ticket.php page is optional. It will only be used down the road for credit purposes (who uploaded how many tracks, etc....)

More on data format:
-Files should contain NMEA sentences
-files should be in text format
-a collection of text files in zipped format will be OK
-Files without the proper format will be given the lowest processing priority

Last edited by Fiberoptic; 07-09-2009 at 09:53 AM.
Fiberoptic is offline   Reply With Quote
Old 07-08-2009, 12:12 PM   #21
Admin. Don't bug or I'll byte.
 
Bugbyte's Avatar
 
Join Date: Sep 2004
Location: Corning, NY
Posts: 6,143
Bugbyte is a splendid one to beholdBugbyte is a splendid one to beholdBugbyte is a splendid one to beholdBugbyte is a splendid one to beholdBugbyte is a splendid one to beholdBugbyte is a splendid one to beholdBugbyte is a splendid one to behold
What's the address of the server?
__________________
Want to:
-Find out about the iBug?
-Stop being a newbie? Take a look at the FAQ Emporium?
-Find out about carPC's in just 5 minutes? View the Car PC 101 video
-Help me kill my car PC
-Watch live video streams from my mobile PC? Check it out here.
-Where is the iBug?
Bugbyte is offline   Reply With Quote
Old 07-08-2009, 12:21 PM   #22
North of the land of Hey Huns
 
malcom2073's Avatar
 
Join Date: Jun 2004
Location: Westminster, MD
Posts: 1,039
malcom2073 is a name known to allmalcom2073 is a name known to allmalcom2073 is a name known to allmalcom2073 is a name known to allmalcom2073 is a name known to allmalcom2073 is a name known to all
Why are you not allowing download of the uploaded data?
__________________
RevFE - Try it, you just might like it.
Carbon - Next Generation Touchscreen Browser
Come join us on IRC: irc.efnet.net #mp3car
Audiophiles make me chuckle as they pad my wallet.
malcom2073 is offline   Reply With Quote
Old 07-08-2009, 01:20 PM   #23
Mp3Car Staff
 
Heather's Avatar
 
Join Date: Sep 2007
Location: Fells Point, Baltimore, MD
Posts: 863
Blog Entries: 8
Heather is a jewel in the roughHeather is a jewel in the roughHeather is a jewel in the rough
There are three key issues in data collection:

1. Authenticity - is the data accurate, real, or is it false, which requires a unique identifier so that data that is false can be stripped out or weighted.

2. Accessibility - the data needs to be accessible to all, so that people can use the data to create better algorithms, better maps, utilize the map data in front ends, etc.

3. Anonymity - everyone needs to be able to upload data and be confident that their identity will not be public. Therefore, data will only be accessible publicly once we determine how we will hide, stripped out or encrypt, the unique identifier necessary for authenticity. Please help Goce determine the best method for this, as once this is solved, the data will be accessible.
__________________
Suggestions or Comments on the forums? Post here.

mp3Car store order questions or products that you would like to sell on the store? Email store @ mp3car.com

Feel free to pm me if you:
  • Have a general comment on mp3Car's products or services
  • Have a product you would like to have tested by the mp3Car community
  • Have a file you would like mp3Car to host
  • Have a cool idea that would improve the forums
Heather is offline   Reply With Quote
Sponsored links
Advertisement
 
Advertisement
Old 07-08-2009, 01:59 PM   #24
Variable Bitrate
 
thekl0wn's Avatar
 
Join Date: Apr 2005
Location: PoCo, Indiana
Posts: 247
thekl0wn is on a distinguished road
I guess the question I pose is this: Is the entire goal aimed at adding to the OSM project, or are we going to have our own database to develop from?
__________________
Planning [----X-----] 40%
Programming [-X-------] 20%
Parts [-----X----] 50%
Install [X--------] 5%

See Me In A Pink Skirt
thekl0wn is offline   Reply With Quote
Old 07-08-2009, 02:01 PM   #25
Mp3Car Staff
 
Heather's Avatar
 
Join Date: Sep 2007
Location: Fells Point, Baltimore, MD
Posts: 863
Blog Entries: 8
Heather is a jewel in the roughHeather is a jewel in the roughHeather is a jewel in the rough
Good question - no we will have our own database and we can give it to OSM, as well as others that want to make use of it. Also, all of the OSM data is available for download from their site.
__________________
Suggestions or Comments on the forums? Post here.

mp3Car store order questions or products that you would like to sell on the store? Email store @ mp3car.com

Feel free to pm me if you:
  • Have a general comment on mp3Car's products or services
  • Have a product you would like to have tested by the mp3Car community
  • Have a file you would like mp3Car to host
  • Have a cool idea that would improve the forums
Heather is offline   Reply With Quote
Old 07-08-2009, 02:03 PM   #26
darth sidious lite
 
Fiberoptic's Avatar
 
Join Date: Jul 1978
Location: Baltimore, MD
Posts: 1,181
Blog Entries: 128
Fiberoptic will become famous soon enoughFiberoptic will become famous soon enough
Quote: Originally Posted by thekl0wn View Post
I guess the question I pose is this: Is the entire goal aimed at adding to the OSM project, or are we going to have our own database to develop from?

At this point they are both goals because OSM doesn't support passive data collection, processing and the algorithms that might make this possible.
Fiberoptic is offline   Reply With Quote
Old 07-08-2009, 02:13 PM   #27
Variable Bitrate
 
thekl0wn's Avatar
 
Join Date: Apr 2005
Location: PoCo, Indiana
Posts: 247
thekl0wn is on a distinguished road
So, what form of database is this going to be? (I ask because my db experience is limited to MS-SQL & SQL-Anywhere) I'm just curious as to the level of participation I could offer... Other than just providing data.
__________________
Planning [----X-----] 40%
Programming [-X-------] 20%
Parts [-----X----] 50%
Install [X--------] 5%

See Me In A Pink Skirt
thekl0wn is offline   Reply With Quote
Old 07-08-2009, 02:47 PM   #28
Mp3Car Staff
 
ecog's Avatar
 
Join Date: Aug 2005
Posts: 88
Blog Entries: 9
ecog is on a distinguished road
Quote:
So, what form of database is this going to be?

I'm thinking a Mysql DB? It's open source and reliable, especially for data dumps.

If this project gets big enough for the database to be a problem, it will be a good problem to have and we can always port to something else.

For data collecting purposes though the DB will be fairly simple. Once the data processing starts we'll might need to get more elaborate and think about it some more.
ecog is offline   Reply With Quote
Old 07-08-2009, 03:41 PM   #29
Mp3Car Staff
 
ecog's Avatar
 
Join Date: Aug 2005
Posts: 88
Blog Entries: 9
ecog is on a distinguished road
Quote: Originally Posted by Bugbyte View Post
@ FO and ecog - How about we do some experiments with NMEA data?

Why don't we compare a couple of datasets and compare the results of:

1. binary
2. zipped
3. raw

Pick some track data from a GPS that is 1 minute, 10 minutes, 20 minutes, 30 minutes and 1 hour long.

Figure the data sizes for all of the options and let's graph them to get an idea of how big the data storage requirements would be. Remember to take into consideration backups of the data both in time and in size/cost.

Then, we should see what kind of processing has to take place on the data to turn it into openstreetmap track information and run it against the datasets. You'll have to do this in the future to process the data and it would be a good idea to see if that would scale.

Then, we can calculate storage size and processing times for 1, 10, 100, 1000, and 10,000 users.

No sense starting it off one way and then finding out it won't scale.

Just some stats from a GPS NMEA stream:

-text file is encoded in ASCII format
-Track length 18.3mi
-Trip time is about 30 min
-Number of NMEA sentences with a 0.5s sample rate from a BU-353 is 7651
-Size of the txt file is 460KB
-Compressed file size is 82KB


more in a little bit.....
ecog is offline   Reply With Quote
Old 07-09-2009, 09:54 AM   #30
darth sidious lite
 
Fiberoptic's Avatar
 
Join Date: Jul 1978
Location: Baltimore, MD
Posts: 1,181
Blog Entries: 128
Fiberoptic will become famous soon enoughFiberoptic will become famous soon enough
Clarified temporary upload details in previous posts. and here.
Fiberoptic is offline   Reply With Quote
Sponsored links
Advertisement
 
Advertisement
Reply

Bookmarks

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
iBusRR v2.0.5: 06/01/2008 BMW Doors/Windows/Lights control NEW RELEASE!! WaRFiElD RR Released Plugins 384 10-14-2009 11:44 AM
Interview Mathieu Monney, creator of XGPS iPhone navigation application Fiberoptic Mp3Car Blog Talk 7 04-06-2009 05:14 PM
Can you guys please help me??? OBD Renault Clio help needed! madtoonbull Engine Management, OBD-II, Engine Diagnostics, etc. 7 02-19-2009 11:36 AM
Pinout Color Codes / Tables gummybear General Hardware Discussion 4 05-12-2005 04:05 AM
VOICES Traffic Data stevieg V.O.I.C.E.S 85 05-31-2004 09:24 PM



All times are GMT -5. The time now is 07:20 AM.


Powered by vBulletin® Version 3.8.4
Copyright ©2000 - 2009, Jelsoft Enterprises Ltd.
Content Relevant URLs by vBSEO 3.3.2
Copyright © 1999 - 2008 Mp3Car.com Inc.Ad Management by RedTyger
Message Board Statistics