Detect to Focus: Latent-Space Autofocusing System with Decentralized Hierarchical Multi-Agent Reinforcement Learning

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

Detect to Focus : Latent-Space Autofocusing System with Decentralized Hierarchical Multi-Agent Reinforcement Learning. / Anikina, Anna; Rogov, Oleg Y.; Dylov, Dmitry V.

I: IEEE Access, Bind 11, 2023, s. 85214-85223.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Anikina, A, Rogov, OY & Dylov, DV 2023, 'Detect to Focus: Latent-Space Autofocusing System with Decentralized Hierarchical Multi-Agent Reinforcement Learning', IEEE Access, bind 11, s. 85214-85223. https://doi.org/10.1109/ACCESS.2023.3303844

APA

Anikina, A., Rogov, O. Y., & Dylov, D. V. (2023). Detect to Focus: Latent-Space Autofocusing System with Decentralized Hierarchical Multi-Agent Reinforcement Learning. IEEE Access, 11, 85214-85223. https://doi.org/10.1109/ACCESS.2023.3303844

Vancouver

Anikina A, Rogov OY, Dylov DV. Detect to Focus: Latent-Space Autofocusing System with Decentralized Hierarchical Multi-Agent Reinforcement Learning. IEEE Access. 2023;11:85214-85223. https://doi.org/10.1109/ACCESS.2023.3303844

Author

Anikina, Anna ; Rogov, Oleg Y. ; Dylov, Dmitry V. / Detect to Focus : Latent-Space Autofocusing System with Decentralized Hierarchical Multi-Agent Reinforcement Learning. I: IEEE Access. 2023 ; Bind 11. s. 85214-85223.

Bibtex

@article{de889ce361f54780aeb122d5e54dc7e6,
title = "Detect to Focus: Latent-Space Autofocusing System with Decentralized Hierarchical Multi-Agent Reinforcement Learning",
abstract = "State-of-the-art object detection models are frequently trained offline using available datasets, such as ImageNet: large and overly diverse data that are unbalanced and hard to cluster semantically. This kind of training drops the object detection performance should the change in illumination, in the environmental conditions (e.g., rain or dust), or in the lens positioning (out-of-focus blur) occur. We propose a simple way to intelligently control the camera and the lens focusing settings in such scenarios using DASHA, a Decentralized Autofocusing System with Hierarchical Agents. Our agents learn to focus on scenes in challenging environments, significantly enhancing the pattern recognition capacity beyond the popular detection models (YOLO, Faster R-CNN, and Retina are considered). At the same time, the decentralized training allows preserving the equipment from overheating. The algorithm relies on the latent representation of the camera's stream and, thus, it is the first method to allow a completely no-reference imaging, where the system trains itself to auto-focus itself. The paper introduces a novel method for auto-tuning imaging equipment via hierarchical reinforcement learning. The technique involves the use of two interacting agents which independently manage the camera and lens settings, enabling optimal focus across different lighting situations. The unique aspect of this approach is its dependence on the latent feature vector of the real-time image scene for autofocusing, marking it as the first method of its kind to auto-tune a camera without necessitating reference or calibration data.",
keywords = "Artificial intelligence, computer vision, imaging, lenses, multi-agent systems, neural networks, photography, reinforcement learning",
author = "Anna Anikina and Rogov, {Oleg Y.} and Dylov, {Dmitry V.}",
note = "Publisher Copyright: {\textcopyright} 2013 IEEE.",
year = "2023",
doi = "10.1109/ACCESS.2023.3303844",
language = "English",
volume = "11",
pages = "85214--85223",
journal = "IEEE Access",
issn = "2169-3536",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

RIS

TY - JOUR

T1 - Detect to Focus

T2 - Latent-Space Autofocusing System with Decentralized Hierarchical Multi-Agent Reinforcement Learning

AU - Anikina, Anna

AU - Rogov, Oleg Y.

AU - Dylov, Dmitry V.

N1 - Publisher Copyright: © 2013 IEEE.

PY - 2023

Y1 - 2023

N2 - State-of-the-art object detection models are frequently trained offline using available datasets, such as ImageNet: large and overly diverse data that are unbalanced and hard to cluster semantically. This kind of training drops the object detection performance should the change in illumination, in the environmental conditions (e.g., rain or dust), or in the lens positioning (out-of-focus blur) occur. We propose a simple way to intelligently control the camera and the lens focusing settings in such scenarios using DASHA, a Decentralized Autofocusing System with Hierarchical Agents. Our agents learn to focus on scenes in challenging environments, significantly enhancing the pattern recognition capacity beyond the popular detection models (YOLO, Faster R-CNN, and Retina are considered). At the same time, the decentralized training allows preserving the equipment from overheating. The algorithm relies on the latent representation of the camera's stream and, thus, it is the first method to allow a completely no-reference imaging, where the system trains itself to auto-focus itself. The paper introduces a novel method for auto-tuning imaging equipment via hierarchical reinforcement learning. The technique involves the use of two interacting agents which independently manage the camera and lens settings, enabling optimal focus across different lighting situations. The unique aspect of this approach is its dependence on the latent feature vector of the real-time image scene for autofocusing, marking it as the first method of its kind to auto-tune a camera without necessitating reference or calibration data.

AB - State-of-the-art object detection models are frequently trained offline using available datasets, such as ImageNet: large and overly diverse data that are unbalanced and hard to cluster semantically. This kind of training drops the object detection performance should the change in illumination, in the environmental conditions (e.g., rain or dust), or in the lens positioning (out-of-focus blur) occur. We propose a simple way to intelligently control the camera and the lens focusing settings in such scenarios using DASHA, a Decentralized Autofocusing System with Hierarchical Agents. Our agents learn to focus on scenes in challenging environments, significantly enhancing the pattern recognition capacity beyond the popular detection models (YOLO, Faster R-CNN, and Retina are considered). At the same time, the decentralized training allows preserving the equipment from overheating. The algorithm relies on the latent representation of the camera's stream and, thus, it is the first method to allow a completely no-reference imaging, where the system trains itself to auto-focus itself. The paper introduces a novel method for auto-tuning imaging equipment via hierarchical reinforcement learning. The technique involves the use of two interacting agents which independently manage the camera and lens settings, enabling optimal focus across different lighting situations. The unique aspect of this approach is its dependence on the latent feature vector of the real-time image scene for autofocusing, marking it as the first method of its kind to auto-tune a camera without necessitating reference or calibration data.

KW - Artificial intelligence

KW - computer vision

KW - imaging

KW - lenses

KW - multi-agent systems

KW - neural networks

KW - photography

KW - reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85167798045&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2023.3303844

DO - 10.1109/ACCESS.2023.3303844

M3 - Journal article

AN - SCOPUS:85167798045

VL - 11

SP - 85214

EP - 85223

JO - IEEE Access

JF - IEEE Access

SN - 2169-3536

ER -

ID: 368343910