Hi there!
When I run the MASK-RCNN to generate the preprocessed result, the label output is quite different. For example, in the first image, the background is 0, object A is 1, the object B is 2. However, in the second image, object A is 2, but object B is 1 (the background is still 0).
How to fix a problem like this to correlate the instances across different frames? Does the VDO slam system require the semantic instances to be aligned across all frames?
Thank you very much.
Hi there!
When I run the MASK-RCNN to generate the preprocessed result, the label output is quite different. For example, in the first image, the background is 0, object A is 1, the object B is 2. However, in the second image, object A is 2, but object B is 1 (the background is still 0).
How to fix a problem like this to correlate the instances across different frames? Does the VDO slam system require the semantic instances to be aligned across all frames?
Thank you very much.