Image Preprocessing-based Generalization and Transfer of Learning for Grasping in Cluttered Environments
- PDF / 1,566,124 Bytes
- 9 Pages / 594.77 x 793.026 pts Page_size
- 36 Downloads / 208 Views
ISSN:1598-6446 eISSN:2005-4092 http://www.springer.com/12555
Image Preprocessing-based Generalization and Transfer of Learning for Grasping in Cluttered Environments Kuk-Hyun Ahn and Jae-Bok Song* Abstract: In a cluttered environment in which objects are lying very closely to each other, the arranging motion is required before the robot attempts to grasp the target object. Thus, a robot must determine which motion to perform based on a given situation. This study presents an approach to learning a decision-making ability for the robot to grasp the target object after rearranging the surrounding objects obstructing the target object. The learning is performed in the virtual environment, and the image, which is an input of the deep Q-network, is preprocessed to directly apply the results of the learning to the real environment. That is, the difference between the two environments is minimized by making the states obtained from the virtual and real environments similar to each other. In addition, image preprocessing can be used to generalize the results of learning so that the robot can determine the appropriate actions to take when objects that were not used for learning are given. A hierarchical structure, which consists of high-level and low-level motion selectors, is used for the learning: the former determines the grasping or pushing actions while the latter determines how to perform such selected actions. The results of various experiments show that the proposed scheme is effective in grasping the target object in a cluttered environment without the need for any additional learning in the real world. Keywords: AI-based application, grasping, reinforcement learning, sim-to-real transfer.
1.
INTRODUCTION
Substantial research has been conducted on grasping in consideration of the fact that grasping is the most basic task that a robot needs to perform. However, in most cases, the focus is on grasping itself, such as the cases estimating the grasp pose without considering the surrounding environment [1–3]. In studies on grasping in cluttered environments, robots usually learn to grasp any object that is expected to be most likely to be grasped without a specified target object [4–6]. This approach has certain limitations if a robot performs a task for which a target object is specified, such as fetching an object specified by a user in a home environment. Some studies have also investigated a method of performing the pushing motion to arrange the adjacent objects for grasping in a cluttered environment. In [7], the scattering motion was carried out when the pixels of the dilated image of the target object overlapped those of the surrounding objects. However, as the scattering motion was performed despite the fact that the adjacent objects did not interfere with grasping, it is not an appropriate method for decision-making. In [8], in order to grasp a target object in a cluttered environment, a task plan was scheduled for a sequence of several given mo-
tions, and the goal was accomplished by conducting these mot
Data Loading...