Top-Down Image Frame Updates #814

mattdeitke · 2021-07-08T01:50:41Z

Now allows third party cameras to be used in place of ToggleMapView with

from ai2thor.controller import Controller
event = controller.step(action="GetMapViewCameraProperties")
controller.step(
    action="AddThirdPartyCamera",
    **event.metadata["actionReturn"]
)

I don't think these should be merged into a single action, because, eventually, Third Party Cameras should be deprecated in favor of having the camera actually be an agent.

Resolves #812.
Fixes #445.
Resolves #599.
Deprecates ToggleMapView.

lgtm-com · 2021-07-08T02:10:18Z

This pull request introduces 2 alerts when merging fd272ce into e8bf86e - view on LGTM.com

new alerts:

1 for Useless assignment to local variable
1 for Dereferenced variable may be null

mattdeitke · 2021-07-08T03:14:48Z

Here is a notebook sharing results: https://colab.research.google.com/drive/1GSIF78B62hNskyr-SYtUMgMdsZ7tJpJQ?usp=sharing

winthos

The new action looks good to go.

I think the only thing is, if we are going to completely deprecate ToggleMapView, we need to be explicit in the documentation that if you do want this new 3rd party, top down camera to follow the agent as the old ToggleMapView camera did, they will need to update the spawned 3rd party camera since it won't automatically follow the agent position and orientation.

I can think of use cases where you would want that old behavior as an option, so perhaps deprecating it entirely is a bit too hasty?

mattdeitke · 2021-07-08T19:50:14Z

I can think of use cases where you would want that old behavior as an option

Can you provide some? I can't think of any reason one would want to move the top-down camera and the agent at the same time, especially when it means the images from the top-down camera will no longer capture the entire scene.

mattdeitke · 2021-07-08T22:32:05Z

Now ready to merge.
See https://colab.research.google.com/drive/1GSIF78B62hNskyr-SYtUMgMdsZ7tJpJQ?usp=sharing for all results.

Lucaweihs

I'm happy with this change, LGTM.

mattdeitke · 2021-07-09T00:11:10Z

Pushed a fix for FloorPlan309, where the camera was actually being placed below the scene. This was because the Ceiling in this room (and in none of the others) actually extends below the floor.

xubo92 · 2021-07-27T20:24:02Z

Hi @mattdeitke

Thanks for this new feature!

Since I was using ai2thor==3.1.0 (on Windows), how can I merge this new feature and make it work in ai2thor-3.1.0?

I tried to git clone ai2thor==3.1.0 and manually change the code files needed and pip install -e ., but that does not work and report error "ValueError: Invalid action: GetMapViewCameraProperties".

Thanks!

xubo92 · 2021-07-27T20:43:50Z

Just a follow-up, since I'm on Windows, does that mean I need to rebuild everything from ai2thor/unity folder in Unity to make this new feature work? The new feature is related to the change of the controller, so I guess the answer is yes?

Thanks!

mattdeitke · 2021-07-27T21:09:19Z

Hey @xubo92,

Yes, if you rebuild locally from the latest commit on the main branch, it will all work, even with version 3.1.0 of the Python API.

xubo92 · 2021-07-28T22:54:07Z

Hi @mattdeitke

I tried this new feature and found out it only works for single agent. When I set agentCount more than 1, then the third_party_camera_frames is empty. Is this normal? Can it by any chance support multi-agent case?

Thanks!

xubo92 · 2021-07-28T23:04:51Z

And a follow-up question for this feature: how can I enable this top-down view all the time for real-time visualization while running controller for many steps?

I tried this code https://colab.research.google.com/drive/1GSIF78B62hNskyr-SYtUMgMdsZ7tJpJQ?usp=sharing#scrollTo=CvZ2PkYp95U2 , it works good when I use plt to show one image. But when I only run my controller and want to visualize in a top-down view constantly, it does not work and still using first-person view.

Thanks!

mattdeitke · 2021-07-28T23:27:16Z

I tried this new feature and found out it only works for single agent. When I set agentCount more than 1, then the third_party_camera_frames is empty. Is this normal? Can it by any chance support multi-agent case?

Here is a demo of it working in the multi-agent case: https://colab.research.google.com/drive/1C_-9zqAqhSUcZvrxVlwlTMZQzWnEzY9k?usp=sharing.

One needs to also pass in agentId and then look at the event that corresponds to that id.

how can I enable this top-down view all the time for real-time visualization while running controller for many steps?

How are you visualizing the frames?

For the single agent case, check out plot_frames in ai2thor-colab. It doesn't yet work with the MultiAgentEvent, but I can probably extend it to if that'd be useful.

xubo92 · 2021-07-29T04:07:37Z

Hi @mattdeitke

Thanks a lot for these examples!

Probably I'm not saying clearly. Here is a code snippet I use

// initialize a controller
self.controller = Controller(agentMode=self.agent_types[0], agentCount=2, scene=self.scene, width=1600, height=900, local_executable_path=local_executable_path)

// setup top-down view
event = self.controller.step(action="GetMapViewCameraProperties")
event = self.controller.step(action="AddThirdPartyCamera", agentId=0, **event.metadata["actionReturn"])

// after setting top-down view, for each move, I expect a unity windows pop out showing what happened in the scene (multiple agents moving), in a top-down view
for i in range(100):
act = np.random.choice(ACTS)
self.controller.step(action=act, agentId=0)
self.controller.step(action=act, agentId=1)

Basically, I'm running a RL algorithm using ai2thor with multiple agents moving around. I expect to visualize the simulated scenes continuously, in a top-down view, so that I can have a sense of the policy training process by keeping an eye on the movement of agents. I expect the top-down view can give me a view of two (or) more agents in a room (not like the first-person view that I have to choose the agentId and get its local view.)

Not sure how to achieve this.

Thanks!

xubo92 · 2021-07-29T04:30:07Z

I remember when use the old command "ToggleMapView", it works like a switch, and one only needs to call it somewhere once (initialization) and after that the default view is switched to top-down view. I thought the new feature works the same way, especially when one has multiple agents so that top-down view can give a whole scene containing every agent.

Sorry I have a lot of questions here...

mattdeitke · 2021-07-29T04:42:31Z

I thought the new feature works the same way

I see. You're looking at the Unity window instead of the image frames.

Basically, the old ToggleMapView took the agents camera, and just move it to the ceiling. But, this had many problems (described here: #599) and breaks nearly all interaction and a lot of the metadata.

Instead, what is now happening, is that we just add a new camera to the scene (a third party camera) and it is positioned such that it captures the scene from a top-down view. However, the agent's camera will still show up in the Unity window.

I don't think it's easy to change which camera is displayed in the Unity window.

What I would do is take videos of the top-down sequences, and save them locally. So, you're code may become:

# initialize a controller
self.controller = Controller(agentMode=self.agent_types[0], agentCount=2, scene=self.scene, width=1600, height=900, local_executable_path=local_executable_path)

# setup top-down view
event = self.controller.step(action="GetMapViewCameraProperties")
event = self.controller.step(action="AddThirdPartyCamera", agentId=0, **event.metadata["actionReturn"])

# after setting top-down view, for each move, I expect a unity windows pop out showing what happened in the scene (multiple agents moving), in a top-down view
map_frames = []
for i in range(100):
    act = np.random.choice(ACTS)
    e1 = self.controller.step(action=act, agentId=0)
    e2 = self.controller.step(action=act, agentId=1)
    map_frames.append(e1.events[0].third_party_camera_frames[0])
    map_frames.append(e2.events[0].third_party_camera_frames[0])

# save the video
from moviepy.editor import ImageSequenceClip
clip = ImageSequenceClip(frames, fps=25)
clip.write_videofile("trajectory.mp4")

xubo92 · 2021-07-30T02:54:47Z

Hi @mattdeitke

Thank you so much for clarifying the difference! Now I understand what is happening.

Saving frames and create a video sounds good! I'll try it.

Thanks!

yaofeng11 · 2022-01-12T11:22:35Z

Hi@mattdeitke
I wonder if it possible to achieve that when I control the robot walking in the room. And the same time,I have the second window to show the total scene of the room (for observe the motion of the robot)

mattdeitke · 2022-01-13T07:07:31Z

Hi @yaofeng11,

There isn't currently a way to view the images from 2 separate cameras in separate Unity windows. But, what might be a workaround is you can do is save images or videos from the agent's perspective and from the top-down perspective, and then view them afterwards.

I just put this notebook demo together on that, which you may find useful:
https://colab.research.google.com/drive/1p6A_LDZA853-6boLdo4ha_1Bg6Cy3Jj9?usp=sharing

Let me know if that helps,
Matt :)

yaofeng11 · 2022-01-13T07:25:32Z

Dear Mr Matt:  Thanks for your help. It bothered me for a whole week. Now I know how to solve it, and I will try it. Much appreciate. Yaofeng :)

…

------------------ 原始邮件 ------------------ 发件人: "allenai/ai2thor" ***@***.***>; 发送时间: 2022年1月13日(星期四) 下午3:07 ***@***.***>; ***@***.******@***.***>; 主题: Re: [allenai/ai2thor] Top-Down Image Frame Updates (#814) Hi @yaofeng11, There isn't currently a way to view the images from 2 separate cameras in separate Unity windows. But, what might be a workaround is you can do is save images or videos from the agent's perspective and from the top-down perspective, and then view them afterwards. I just put this notebook demo together on that, which you may find useful: https://colab.research.google.com/drive/1p6A_LDZA853-6boLdo4ha_1Bg6Cy3Jj9?usp=sharing Let me know if that helps, Matt :) — Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android. You are receiving this because you were mentioned.Message ID: ***@***.***>

first go at getting top-down camera attrs

fd272ce

mattdeitke requested review from Lucaweihs, ekolve and winthos July 8, 2021 01:50

mattdeitke added 4 commits July 7, 2021 19:29

use Vector3 instead of quaternion

3d24530

remove ceiling

ab634af

fix lgtm bug

fb6a140

make sure position isn't above ceiling

603e7e4

mattdeitke added 3 commits July 8, 2021 12:14

there may be multiple ceiling objects :0

eab4319

add mvp to debug input

e2d3392

some edge cases on ceiling go name

d579eb6

winthos reviewed Jul 8, 2021

View reviewed changes

mattdeitke added 4 commits July 8, 2021 13:06

fix for downward sloping ceiling

ed617c3

incase ceiling isn't in name

e1f9a6c

change back tmv wrt removing ceiling

cdb167b

use max y value without ceiling obj

1b6148d

winthos approved these changes Jul 8, 2021

View reviewed changes

ekolve approved these changes Jul 8, 2021

View reviewed changes

Lucaweihs approved these changes Jul 8, 2021

View reviewed changes

patch edge case in fp309

07ac69f

mattdeitke merged commit ec298b9 into main Jul 9, 2021

mattdeitke deleted the tmapView branch July 9, 2021 01:09

mattdeitke mentioned this pull request Aug 15, 2022

How to get top-down map without the agent in the scene? #1059

Open

mengmeng6660 mentioned this pull request Mar 18, 2024

How to get a top view of a room in ai2thor #1196

Open

Top-Down Image Frame Updates #814

Top-Down Image Frame Updates #814

Uh oh!

Conversation

mattdeitke commented Jul 8, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lgtm-com bot commented Jul 8, 2021

Uh oh!

mattdeitke commented Jul 8, 2021

Uh oh!

winthos left a comment

Choose a reason for hiding this comment

Uh oh!

mattdeitke commented Jul 8, 2021

Uh oh!

mattdeitke commented Jul 8, 2021

Uh oh!

Lucaweihs left a comment

Choose a reason for hiding this comment

Uh oh!

mattdeitke commented Jul 9, 2021

Uh oh!

xubo92 commented Jul 27, 2021

Uh oh!

xubo92 commented Jul 27, 2021

Uh oh!

mattdeitke commented Jul 27, 2021

Uh oh!

xubo92 commented Jul 28, 2021

Uh oh!

xubo92 commented Jul 28, 2021

Uh oh!

mattdeitke commented Jul 28, 2021

Uh oh!

xubo92 commented Jul 29, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xubo92 commented Jul 29, 2021

Uh oh!

mattdeitke commented Jul 29, 2021

Uh oh!

xubo92 commented Jul 30, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yaofeng11 commented Jan 12, 2022

Uh oh!

mattdeitke commented Jan 13, 2022

Uh oh!

yaofeng11 commented Jan 13, 2022 via email

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

mattdeitke commented Jul 8, 2021 •

edited

Loading

xubo92 commented Jul 29, 2021 •

edited

Loading

xubo92 commented Jul 30, 2021 •

edited

Loading