Sponsored links

Go Back   MP3Car.com > Mp3Car Technical > Linux


Reply
 
Share Thread Tools Display Modes
Old 08-04-2007, 06:14 AM   #1
Maximum Bitrate
 
Join Date: Aug 2004
Location: at home
Posts: 588
kraft is on a distinguished road
computer vision stuff

Using the Intel's library, now face position recognition.
Now i'm searching to build model for first, recognize one face and only one, second build model of lips while speaking and make it able to recognize phonemes and by the way read lips. This plus voice recognition will kill the weak point of simple voice recognition.

So i will use the same tools or inspired from the Sphinx project for models of pronouciation, i will record some news on tv while people are speaking and try to get some vectors models from those records. In the end maybe it will help me to build a lips reading model.












Last edited by kraft; 08-08-2007 at 04:21 AM.
kraft is offline   Reply With Quote
Advertisement
 
Advertisement
Sponsored links

Old 08-04-2007, 07:25 AM   #2
One Sharp Cheddar
 
ThunderStick's Avatar
 
Join Date: Aug 2006
Location: West Allis, WI
Posts: 1,558
ThunderStick is on a distinguished road
Good luck with that
ThunderStick is offline   Reply With Quote
Old 08-04-2007, 07:49 AM   #3
Maximum Bitrate
 
Join Date: Aug 2004
Location: at home
Posts: 588
kraft is on a distinguished road
It won't hurt trying
i found model of vectors for letters recognition using this library, so a particular mouth shape in a moment or a letter...still vectors
kraft is offline   Reply With Quote
Old 08-04-2007, 09:37 AM   #4
Maximum Bitrate
 
Join Date: Aug 2004
Location: at home
Posts: 588
kraft is on a distinguished road
Video of the shape/face recognition

http://www.dailymotion.com/video/x2o...de-visage_auto

http://www.dailymotion.com/video/x2o...cond-test_tech

Last edited by kraft; 08-04-2007 at 10:10 AM.
kraft is offline   Reply With Quote
Old 08-04-2007, 12:09 PM   #5
Maximum Bitrate
 
Join Date: Aug 2004
Location: at home
Posts: 588
kraft is on a distinguished road
follow my eyes

http://www.dailymotion.com/video/x2o...w-my-eyes_tech

now read my lips...well not yet but with this sample code, i'll attempt to build a tool to first :
fill one array with points, second define a vector based on those points, in the end with a gentle and patient model bulding should give something that looks like a result

http://www.dailymotion.com/video/x2o...d-my-lips_tech
kraft is offline   Reply With Quote
Old 08-04-2007, 01:47 PM   #6
Constant Bitrate
 
Join Date: Mar 2006
Posts: 145
ReflexNPG is on a distinguished road
Nice idea!

What do you think about gesture recognition? Or about a "virtual console"? For example just slide the finger over the dash to change volume or seek.

Good luck with your project!
ReflexNPG is offline   Reply With Quote
Old 08-04-2007, 02:01 PM   #7
Maximum Bitrate
 
Join Date: Aug 2004
Location: at home
Posts: 588
kraft is on a distinguished road
I thought about this but with a follow my eyes function but actually following an eye direction is much more difficult as it request a very sensitive camera or a high lighted environment, so a car is not a so good place for so tiny details....a finger to "touch" in the air some functions displayed on screen could be an elegant solution and a far "easier" to do

In addition of the follow me function, some morpho functions that could identify a particular face (opencv have them) could be amusing and a pretty good anti-theft system
kraft is offline   Reply With Quote
Old 08-07-2007, 08:47 AM   #8
Maximum Bitrate
 
Join Date: Aug 2004
Location: at home
Posts: 588
kraft is on a distinguished road
Here is an attempt to make the thing follow my eyes and also detect in which direction i look.

To understand what is happening in the video, the circles appearing are here to materialize a region of interest the code detected. Inside of the circle you will see a segment starting from center and showing the heading of the move that happens.

After filtering i'm pretty sure that i can get it simplified and be usable as a mouse cursor or better as a virtual touch screen, still have work and probably it won't work but it's fun

Please note that the purpose of this video is not to be nice to watch, if i can do what i plan to do, the video shown actually won't even be visible. Actually it is just to show a feedback of what is happening inside the computer.

http://www.dailymotion.com/video/x2p...h-my-eyes_auto

http://www.dailymotion.com/video/x2p...ouvements_tech

Last edited by kraft; 08-07-2007 at 12:42 PM. Reason: added a link to another video
kraft is offline   Reply With Quote
Sponsored links
Advertisement
 
Advertisement
Old 08-07-2007, 04:35 PM   #9
Maximum Bitrate
 
Join Date: Aug 2004
Location: at home
Posts: 588
kraft is on a distinguished road
This time a far better code very fluid with the real video layer, no delayed timings, looks like i was playing with some alive stuff or manipulating mercury but no directions detection, it is more suitable for gestures. The tracking abilities of this code and the direction detection of the other would probably make something really usable

http://www.dailymotion.com/video/x2p...e-video_webcam
kraft is offline   Reply With Quote
Old 08-07-2007, 06:19 PM   #10
Constant Bitrate
 
Join Date: Mar 2006
Posts: 145
ReflexNPG is on a distinguished road
your experiments are very interesting and i think an iphone like virtual interface is possible...

I've stopped my research, cause its very hard to use it in the car. There are so different light situations (even no light, when you drive @ night).
So it needs at least an infrared ilumination and when the sun is near the horizon - thats a great challenge

I have another experiment 4 you: I am trying to digitize the power line. All connected parts leave their traces in form of signals there. A neural network processes the signals and can find out if somethings "wrong". This could be an early detection of a possible malfunction. With enough data it would even be possible to identify the bad component. Second step would be the same with a microphone in the engine bay.

The system had 2 learn normal events like turning on lights or shifting gear etc
ReflexNPG is offline   Reply With Quote
Old 08-07-2007, 08:43 PM   #11
Maximum Bitrate
 
Join Date: Aug 2004
Location: at home
Posts: 588
kraft is on a distinguished road
They are a lot of filtering options and ir camera with an ir projector can be used
Before i start something else i will try to go forward with this cv library, for lights etc it's already done, look inside this thread, there are some interesting pics with the motorhome

http://www.mp3car.com/vbulletin/off-...-attempts.html
kraft is offline   Reply With Quote
Old 08-08-2007, 03:38 AM   #12
Low Bitrate
 
Join Date: Jan 2007
Posts: 75
KaZeR is an unknown quantity at this point
This is really amazing.

Maybe you should try xvidcap or something like that, it would probably make better videos?
KaZeR is offline   Reply With Quote
Old 08-08-2007, 03:51 AM   #13
Fusion Brain Creator
 
2k1Toaster's Avatar
 
Join Date: Mar 2006
Location: Colorado, but Canadian!
Posts: 8,862
2k1Toaster has a brilliant future2k1Toaster has a brilliant future2k1Toaster has a brilliant future2k1Toaster has a brilliant future2k1Toaster has a brilliant future2k1Toaster has a brilliant future2k1Toaster has a brilliant future2k1Toaster has a brilliant future2k1Toaster has a brilliant future2k1Toaster has a brilliant future2k1Toaster has a brilliant future
This is definately cool! Good job so far!
2k1Toaster is offline   Reply With Quote
Old 08-08-2007, 04:03 AM   #14
Raw Wave
 
shotgunefx's Avatar
 
Join Date: Apr 2005
Location: Boston, MA
Posts: 1,800
shotgunefx is on a distinguished road
Interesting stuff, stop getting me thinking

As far as tracking the eyes, what about 3 (or more) IR leds around the screen, then a visible light filter over the camera, I would think you would be able to see them reflected in the iris if you were looking that way. Maybe you could use the change in the "cluster" position and size to determine eye position?
shotgunefx is offline   Reply With Quote
Old 08-08-2007, 04:05 AM   #15
Maximum Bitrate
 
Join Date: Aug 2004
Location: at home
Posts: 588
kraft is on a distinguished road
Quote: Originally Posted by KaZeR View Post
This is really amazing.

Maybe you should try xvidcap or something like that, it would probably make better videos?

You are absolutely right, what i'm doing with my camera is to show it is real and not some vapor thing made with some video editor.
kraft is offline   Reply With Quote
Sponsored links
Advertisement
 
Advertisement
Reply

Bookmarks

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Speech recognition ZbyszekPluta General MP3Car Discussion 0 01-11-2007 05:24 PM
SpeakEasy Voice Recognition - Beta 1 ruairi Road Runner 31 08-23-2006 12:12 AM
Voice recognition problems. WinXP/Navivoice. mikebaz Software & Software Development 5 08-07-2005 04:44 PM
Any Gauge Face Creation Ideas? xhazn Fabrication 14 05-18-2005 09:09 AM
Just had a great idea for using a detachable cd players face Gemini General Hardware Discussion 8 05-13-2004 01:15 PM



All times are GMT -5. The time now is 11:22 PM.


Powered by vBulletin® Version 3.8.4
Copyright ©2000 - 2009, Jelsoft Enterprises Ltd.
Content Relevant URLs by vBSEO 3.3.2
Copyright © 1999 - 2008 Mp3Car.com Inc.Ad Management by RedTyger
Message Board Statistics