Low-Barrier Object Detection for Mobile Applications

  • Anh Hoang +84967077784 https://orcid.org/0000-0003-2754-3710
  • Khoi Nguyen The Department of Information Technology, FPT University HCMC
  • Tuong Ho Vinh Department of Information Technology, TDT University
Keywords: Distillation, object detection, android application

Abstract

This paper investigates innovative applications of image processing and object recognition methods aimed at simplifying the creation and deployment of object detection models. By doing so, we seek to expand access to advanced computer vision technologies for small and medium-sized businesses. Our research leverages a combination of modern technologies including Flask for web development, Firebase for database management, and Kotlin and Jetpack Compose for mobile application development. We integrate these with automatic training methods provided by the Autodistill li- brary, utilizing models such as Detic and YOLOv8.

The results demonstrate that this technological combination significantly enhances the performance of our object detection models, contributing to AI solutions in the digital intelligence era. A notable advancement is Autodistill’s capability to bypass the traditional dataset creation step by automatically generating labeled datasets from unlabeled input data. This feature markedly improves the efficiency and effectiveness of both data preparation and model training processes. Overall, our findings underscore the potential of these integrated technologies to democratize access to sophisticated computer vision capabilities for smaller enterprises, fostering greater innovation and competitiveness in the marketplace.

Author Biographies

Anh Hoang, +84967077784

Anh HOANG received the B.S. degree in Telecommunication engineer from the Department of Electrical, Electronic, and Information Engineering, Hanoi University of Transport and Communication, in 2007, and the M.S. degree in Computer Science (major in Wireless Networks Security) from the National Taiwan University of Science and Technology (NTUST), Taiwan, in 2010. He completed Ph.D. program at the Graduate School of Advanced Science and Technology (major in Knowledge Science), Japan Advanced Institute of Science and Technology (JAIST), Japan, in September 2021. His research interests are related to AI/Machine Learning, Data Science/Data Mining/Data Analytics, CyberSecurity, and Business Intelligence/ Business Analytics.

Khoi Nguyen The, Department of Information Technology, FPT University HCMC

Khoi Nguyen The, received the bachelor’s degree in artificial intelligence from FPT University. His current research interests include computer vision, object detection, deep learning, and self-supervised learning approaches.

Tuong Ho Vinh, Department of Information Technology, TDT University

Tuong Ho Vinh, I am a final year Software Engineering student at TDT University, passionate about researching and developing computer vision technology. I have experience in projects related to image processing and deep learning, and I aspire to contribute to the advancement of this field through scientific research. My goal is to apply the knowledge and skills I have acquired to solve practical problems and drive technological development.

Published
2024-11-25