SahiKid: An Efficient SAHI-based Framework for Small Kidney Stone Detection in CT Scans

  • Phu An Thai Can Tho University
  • Thanh Ma Can Tho University
  • Tien Dao Luu Can Tho University
  • Huu Hoa Nguyen Can Tho University
  • Phat Dat Trinh Can Tho University
  • Van Kha Phan Can Tho University
  • Chi Thien Nguyen VNPT
Keywords: Kidney stones, Small object detection, CT images, SAHI, Medical image analysis, Deep learning

Abstract

Kidney stones are a prevalent urological issue, and early detection—particularly of small stones on CT scans—is critical for effective treatment and the prevention of complications. However, accurately identifying these tiny structures remains challenging due to their size and the complexity of surrounding anatomy. This study proposes an efficient detection framework that integrates a lightweight object detection (OD) model (i.e., Yolov11n) with Slicing Aided Hyper Inference (SAHI) techniques. In addition, we introduce a novel algorithm, termed B-SAHI, to improve computation time. Our proposals are evaluated on a public CT dataset under various settings, including full-image inference (FI), Slicing Aided Fine-tuning (SF), patch overlapping (PO), and kidney region localization (KD). The best-performing implementation—OD with SF, SAHI, PO, and KD—achieved a mean Average Precision (mAP50) of 78.66%, vastly outperforming the 0.05% baseline. In terms of efficiency, B-SAHI reduced the computation time from 0.1553 to 0.0329 seconds, enabling real-time performance. These results underscore the effectiveness of slice-based inference and anatomical focus in enhancing small object detection in medical imaging.

References

1. Akyon, F.C., Altinuc, S.O., Temizel, A.: Slicing aided hyper inference and finetuning for small object detection. In: 2022 IEEE international conference on image
processing (ICIP). pp. 966–970. IEEE (2022)
2. Anwar, S.M., Majid, M., Qayyum, A., Awais, M., Alnowami, M., Khan, M.K.:
Medical image analysis using convolutional neural networks: a review. Journal of
medical systems 42, 1–13 (2018)
3. Bhojani, N., Paonessa, J.E., El Tayeb, M.M., Williams Jr, J.C., Hameed, T.A.,
Lingeman, J.E.: Sensitivity of noncontrast computed tomography for small renal
calculi with endoscopy as the gold standard. Urology 117, 36–40 (2018)
4. Brisbane, W., Bailey, M.R., Sorensen, M.D.: An overview of kidney stone imaging
techniques. Nature Reviews Urology 13(11), 654–662 (2016)
5. Chaney, E.L., Pizer, S.M.: Defining anatomical structures from medical images. In:
Seminars in Radiation Oncology. vol. 2, pp. 215–225. Elsevier (1992)
6. Cheng, G., Yuan, X., Yao, X., Yan, K., Zeng, Q., Xie, X., Han, J.: Towards large-scale small object detection: Survey and benchmarks. IEEE Transactions on Pattern Analysis and Machine Intelligence 45(11), 13467–13488 (2023).
https://doi.org/10.1109/TPAMI.2023.3290594
7. Ding, J., Xue, N., Xia, G.S., Bai, X., Yang, W., Yang, M., Belongie, S., Luo, J.,
Datcu, M., Pelillo, M., Zhang, L.: Object detection in aerial images: A large-scale
benchmark and challenges. IEEE Transactions on Pattern Analysis and Machine
Intelligence pp. 1–1 (2021). https://doi.org/10.1109/TPAMI.2021.3117983
8. Giger, M.L., Suzuki, K.: Computer-aided diagnosis. In: Biomedical information
technology, pp. 359–XXII. Elsevier (2008)
9. Girshick, R.: Fast r-cnn. In: Proceedings of the IEEE international conference on
computer vision. pp. 1440–1448 (2015)
10. Kwateng, W.N.: Kidney stones (nephrolithiasis) statistics: Global and us overview,
types, trends, and complications
11. Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., Berg, A.C.: Ssd:
Single shot multibox detector. In: Computer Vision–ECCV 2016: 14th European
Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part
I 14. pp. 21–37. Springer (2016)
12. Liu, Y., Sun, P., Wergeles, N., Shang, Y.: A survey and performance evaluation of
deep learning methods for small object detection. Expert Systems with Applications 172, 114602 (2021)
13. Mirzaei, B., Nezamabadi-Pour, H., Raoof, A., Derakhshani, R.: Small object detection and tracking: a comprehensive review. Sensors 23(15), 6887 (2023)
14. Neubeck, A., Van Gool, L.: Efficient non-maximum suppression. In: 18th international conference on pattern recognition (ICPR’06). vol. 3, pp. 850–855. IEEE
(2006)
15. Ramesh, B., Srinivas, G., Reddy, P.R.P., Rasool, M.H., Rawat, D., Sundaray, M.:
Feasible prediction of multiple diseases using machine learning. In: E3S Web of
Conferences. vol. 430, p. 01051. EDP Sciences (2023)
16. Rasheed, A.F., Zarkoosh, M.: Yolov11 optimization for efficient resource utilization.
arXiv preprint arXiv:2412.14790 (2024)
17. Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: Unified,
real-time object detection. In: Proceedings of the IEEE conference on computer
vision and pattern recognition. pp. 779–788 (2016)
18. Wei, W., Cheng, Y., He, J., Zhu, X.: A review of small object detection based on
deep learning. Neural Computing and Applications 36(12), 6283–6303 (2024)
19. Zhu, P., Wen, L., Du, D., Bian, X., Fan, H., Hu, Q., Ling, H.: Detection and tracking meet drones challenge. IEEE Transactions on Pattern Analysis and Machine
Intelligence pp. 1–1 (2021). https://doi.org/10.1109/TPAMI.2021.3119563
Published
2025-10-01