List

好奇目前先進駕駛輔助系統(Advanced Driver Assistance Systems;ADAS)領域供應商在深度學習的發展狀況為何。剛好看了一篇關於 Mobileye的報導,又把NVIDIA BB8的技術文章拿來翻了一下,有趣的比對。

 

Mobileye目前使用的solution是deep learning based 嗎?好奇。

NVIDIA BB8倒是展示了一件事,是不是可以利用end-to-end的作法,讓學習的網路由目前路面影像的輸入直接決定方向盤該轉幾度?這跟之前得先偵測出路面、分割線、道路邊緣的作法大大不同。這也暗示,對於其他深度學習的應用,也可以直接最佳化最後的標的,中間的特徵值或是決策,直接交給網路學習決定。

We do not work on ADAS (advanced driver assistance systems) solutions. However, we are curious about how they develop the solutions in the “deep learning” paradigm. As the outsiders, it’s hard to evaluate their competitiveness. However, it’s fun to evaluate the “relativeness”(but in an imprecise manner) from their PRs.

BTW, are Mobileye’s current deployed solutions deep-learning-based? Anyone knows?

Just saw the following news from MIT Technology review. It looks like the very similar goal from NVIDIA’s BB8, which takes raw camera pixels to determine steering commands during lane (or road) following.

Mobileye is to test their neural network system (augmented with reinforcement learning) in the second half of 2017, which might be 3-4 quarters late from NVIDIA’s demo, more a proof of concept (see the following). Definitely, there will be more advanced features such as lane merging in heavy traffics, risk management triggered by other drivers, etc. It’s hard to judge from the PRs.

NVIDIA’s BB8 is interesting and worthy to glance. It’s more like a proof or concept demo showing that we can skip the intermedia representations — recognizing the objects (e.g., lane markings, outline of the road, objects, trees,) on the roads — that many prior work relied on for steering a vehicle but focus on learning the proper turning radius to keep the car properly in lane. To train the end-to-end system (from images to steering radius, BB8 only uses a common convolutional neural networks and 72 hours of training videos (of three cameras, left, center, and right). No reinforcement learning is adopted yet as shown in their arXiv paper. There are definitely more fancy technical components they can adopt. I bet they are doing as well.

Note that BB8 only describes how to follow on lane and road. It does not evaluate on lane changes, turns from one road to another, or other risk handling tasks. The data augmentation process for teaching the networks to recover from a poor position or orientation by adding the (synthesized) shifts and rotations from the images is interesting. I bet the similar idea can benefit data augmentation for learning from sequences of data.

For BB8, I am referring to NVIDIA’s research work, Bojarski et al., “End to End Learning for Self-Driving Cars,” in arXiv, submitted on April 25, 2016.

 

  Posts

1 2 3 6
June 7th, 2018

First Place (#1) in Disguised Face Recognition in CVPR 2018

January 6th, 2019

意想不到的科技部「AI投資潛力獎第一名」

December 16th, 2018

Keep Recruiting for Machine Learning Research Partners for Numerous Visual Sensors

December 16th, 2018

FutureTech Demo and Breakthrough Award (未來科技突破獎)

October 12th, 2018

結合虛與實的試鞋生成網路 (Virtual Try-On Shoe with Generative Neural Networks)

我們都有這樣的困擾,在電子購物的時候,看到一雙好看的鞋子,想買。但是卻又拿不定主意自己穿起來好看嗎?或是搭配某件褲子適合嗎?怎麼讓網路虛擬商城的鞋子,可以有效試在自己的腳上呢? 這個工作的挑戰在於如何使用單張鞋子商品的照片,很自然的合成在使用者的腳上,而且腳可能會有各種姿勢、角度。如何客服這個問題? 很高興大學部專題生(EE) 周晁德 完成了這個 PIVTONS 的虛擬鞋子試穿生成網路,試著解決這個困擾大家很久的問題。 這個有趣的工作也將於十二月初,在澳洲珀斯舉辦 Asian Conference on Computer Vision (ACCV) 2018 以大會演說 (Oral) 的方式跟大家分享這個工作。接下來全新的測試資料集將會公開讓大家使用,如果可以的話,我們也將試試看將整個試穿生成系統上線,讓大家體驗虛擬試鞋的樂趣 — 可以多試穿,多省錢。 我們鼓勵high-risk的研究工作。令人慶幸的是,這工作的發想、資料收集都是專題生獨立完成。當然在過程當中遇到很多GAN生成的問題,網路設計、訓練的問題,幾乎放棄了,還好團隊成員一起想辦法解決,關關難過,關關過(甚至免費擔任model),讓這個兼具技術深度以及商業價值的系統,可以順利完成。 我們也一直努力,讓智能生成(或是辨識)系統,賦予更有意義的應用 […]

September 13th, 2018

Finalist (Top 3) in 2018 IEEE Signal Processing Society Video and Image Processing (VIP) Cup

July 29th, 2018

信手拈來的3D模型搜尋 (Cross-View and Cross-Domain 3D Model Search)

July 27th, 2018

低解析人臉辨識跟解析度放大 (Very Low-Resolution Face Hallucination and Recognition)

June 18th, 2018

Winning Third Place in CVPR 2018 Video Recognition Challenge — Moments in Time

June 13th, 2018

[Video Report] National Investment for the GPU Supercomputer?

June 9th, 2018

Amazing Crowd Size and Positive Feedbacks in the Deep Learning Lecture for GTC 2018 Taipei