List

For navigating the complex surrounding, we envision a proactive and adaptive drone-based building identification technology — recognizing the key buildings in arbitrary deployed cities. However, navigating in the complex city will be a huge problem even with the state-of-the-art sensing and mapping technologies as flying in the sky is hugely different from driving on the streets.

新的嘗試,有沒有可能無人機出廠之後,可以在任一個都市飛行過程中,動態、主動地辨識出主要的地標或是建築物?

We frame this drone-view building identification as “few-shot learning problem” for the major buildings in the city. The identification process is on the fly and hopes to adapt to different cities whenever deployed.

As the drone navigates the city, the (surrounding) buildings of interest are retrieved from the Web (or maps) provided current drone location. We associate the building proposals (in the drone view) with the candidates (in the Web or maps) by the (learned) visual and sensor similarities. In general, the buildings are with few user-contributed or even street-view photos. However, the view is completely different from the drone view. We designed a cross-view learning neural networks to address the problem. We also investigate the impacts from other sensor information (e.g., GPS, compass, etc.)

To our best knowledge, this is probably the first work to address the problem. There should other promising methods for further improving the newly formed problem. We also make the drone-view building identification dataset: “DroneViewBI” public for the academic usage. Feel free to comment and enjoy the carefully collected dataset. The work will also be presented in the CVPR 2018 workshop.

Dataset: https://jacky82226.github.io/DVBI/

Paper: Chun-Wei Chen, Yin-Hsi Kuo, Tang Lee, Cheng-Han Lee, Winston Hsu. Drone-View Building Identification by Cross-View Visual Learning and Relative Spatial Estimation, CVPR Workshops (CVPRW) 2018.

Note that the demo video shows the recognized buildings (on the fly) as navigating Taipei city. The grey boxes are the candidate location proposals.

  Tag: dataset

3 posts
June 7th, 2018

Drone-View Building Identification: Dataset and the Pilot Work

For navigating the complex surrounding, we envision a proactive and adaptive drone-based building identification technology — recognizing the key buildings […]

February 6th, 2018

Public “NetiLook Dataset” for Netizen-Style Commenting on Fashion Photos

你能區分影像社群網站的留言是來自機器還是真實使用者嗎? 從對話機器人、或是我們之前參與的小冰寫詩,我們知道影像與自動文字描述的重要性。超越平鋪直述的影像文字敘述,過去兩年我們研究「鄉民式」的影像自動留言系統,針對流行服飾的社群網站。主要研究如何產生更生動、更多樣性的影像對話內容,結合影像以及文字的深度學習技術。我們發現如何量測「多樣性」、「新鮮感」都是個未探討的議題。 同樣地,我們樂於將先導性研究所採集的大型資料與全球的研究社群分享,也是目前唯一相關研究的資料集。所採集目前含有超過35萬張照片、一萬一千位使用者、五百萬留言的「NetiLook」資料集目前公開下載。 相關技術也將發表在頂尖會議 WWW 2018 (Cognitive Computing Track). 我們並發現,一般使用者很難區分機器產生或是使用者的留言。 Recently, image captioning has appeared promising, which is expected to […]

February 7th, 2017

A brand new dataset for photo filter recommendation by aesthetic learning (美的鑑賞,人工智慧也可以嗎?)

To make photos more visually appealing, users usually apply filters on their photos. However, due to the growing number of […]