Module 2 Quiz: Sensors and Perception for Humanoid Robots
Test your understanding of camera systems, depth sensing, IMU sensors, and sensor fusion techniques for humanoid robots.
Instructions
- Total Questions: 15
- Passing Score: 12/15 correct (80%)
- Time Limit: 45 minutes (recommended)
- Question Types: Multiple choice, short answer, code reading
- Resources: You may refer to lesson notes during the quiz
- Scoring:
- Multiple choice: 1 point each
- Short answer: 1-2 points each (partial credit available)
- Code reading: 2 points each (with rubric)
Need to review? Questions reference specific lessons. If you're unsure, revisit:
- Questions 1-4: Lesson 1 - Camera Systems
- Questions 5-8: Lesson 2 - Depth Sensing
- Questions 9-11: Lesson 3 - IMU & Proprioception
- Questions 12-13: Lesson 4 - Sensor Fusion
- Questions 14-15: Integration across multiple lessons
Questions
Lesson 1: Camera Systems (Questions 1-4)
Question 1 (Multiple Choice) - 1 point
What is the primary difference between monocular, stereo, and RGB-D cameras in terms of depth perception?
A) Monocular cameras can see in 3D, stereo cameras are 2D, RGB-D cameras are color only B) Monocular cameras provide no depth, stereo cameras estimate depth through triangulation, RGB-D cameras provide direct depth measurements C) Monocular cameras are for indoor use, stereo cameras for outdoor, RGB-D cameras for underwater D) Monocular cameras are expensive, stereo cameras are cheap, RGB-D cameras are free
Reveal Answer
Correct Answer: B
Explanation: Monocular cameras provide only 2D images with no depth information. Stereo cameras use two lenses to capture slightly different views, allowing depth estimation through triangulation. RGB-D cameras (like Kinect) provide both color (RGB) and direct depth measurements (D) from structured light or time-of-flight sensors.
Lesson Reference: 01-camera-systems.md (Camera Types for Humanoid Robots)
Question 2 (Multiple Choice) - 1 point
Which ROS2 message type is used for standard camera image data?
A) sensor_msgs/Image B) sensor_msgs/Camera C) sensor_msgs/Video D) sensor_msgs/CameraImage
Reveal Answer
Correct Answer: A
Explanation: The sensor_msgs/Image message type is the standard ROS2 message for camera image data. It contains the raw pixel data along with metadata like encoding, height, width, and step size.
Lesson Reference: 01-camera-systems.md (ROS2 Integration - sensor_msgs/Image)
Question 3 (Short Answer) - 2 points
Explain the trade-offs between placing cameras on a humanoid robot's head, chest, and wrists. For each location, describe one advantage and one disadvantage for humanoid robot tasks.
Reveal Answer & Rubric
Sample Answer:
- Head: Advantage - matches human eye perspective for natural interaction and navigation. Disadvantage - limited view of hands during manipulation tasks.
- Chest: Advantage - good view of ground and obstacles for navigation. Disadvantage - limited view of upper environment and human faces during interaction.
- Wrists: Advantage - provides visual feedback during manipulation tasks. Disadvantage - limited field of view and constantly moving with arm motion.
Grading Rubric (2 points total):
- 2 points: Correctly identifies advantages and disadvantages for all three locations with specific robot tasks
- 1 point: Correctly identifies advantages/disadvantages for 1-2 locations OR has partial understanding
- 0 points: Incorrect or irrelevant responses
Lesson Reference: 01-camera-systems.md (Camera Placement Strategies)
Question 4 (Short Answer) - 1 point
What does the "field of view" (FOV) parameter indicate for a camera, and why is it important for humanoid robot navigation?
Reveal Answer & Rubric
Sample Answer: Field of view indicates the angular extent of the scene that a camera can capture. For humanoid robots, FOV is important because it determines how much of the environment the robot can see at once, affecting obstacle detection, navigation safety, and situational awareness.
Grading Rubric (1 point):
- 1 point: Correctly explains FOV as angular extent AND relates to robot navigation/environment awareness
- 0 points: Incorrect or incomplete explanation
Lesson Reference: 01-camera-systems.md (Camera Parameters - Field of View)
Lesson 2: Depth Sensing (Questions 5-8)
Question 5 (Multiple Choice) - 1 point
How does 2D LiDAR differ from 3D LiDAR in terms of environmental perception for humanoid robots?
A) 2D LiDAR is for indoor use, 3D LiDAR is for outdoor use B) 2D LiDAR measures distance in one plane (typically ground level), 3D LiDAR measures in multiple planes (volume) C) 2D LiDAR is faster, 3D LiDAR is more accurate D) 2D LiDAR is expensive, 3D LiDAR is cheap
Reveal Answer
Correct Answer: B
Explanation: 2D LiDAR typically scans in a single horizontal plane and provides distance measurements in that plane only. 3D LiDAR scans in multiple planes to create a 3D point cloud, providing full volumetric information about the environment including height and depth.
Lesson Reference: 02-depth-sensing.md (LiDAR Technologies - 2D vs 3D)
Question 6 (Multiple Choice) - 1 point
Which ROS2 message type is used for 3D point cloud data from depth sensors?
A) sensor_msgs/PointCloud B) sensor_msgs/Point3D C) sensor_msgs/PointCloud2 D) sensor_msgs/LaserScan3D
Reveal Answer
Correct Answer: C
Explanation: sensor_msgs/PointCloud2 is the standard ROS2 message for 3D point cloud data. It contains structured binary data representing 3D coordinates of points in space, along with optional color and other per-point information.
Lesson Reference: 02-depth-sensing.md (ROS2 Integration - sensor_msgs/PointCloud2)
Question 7 (Short Answer) - 2 points
Compare the advantages and disadvantages of LiDAR vs stereo vision for humanoid robot navigation. Provide one advantage and one disadvantage for each technology.
Reveal Answer & Rubric
Sample Answer:
- LiDAR Advantages: Precise distance measurements, works in darkness, reliable in various lighting conditions
- LiDAR Disadvantages: Cannot detect transparent surfaces, higher cost, sensitive to rain/fog
- Stereo Vision Advantages: Provides color information, lower cost, works in most environments
- Stereo Vision Disadvantages: Sensitive to lighting conditions, less accurate at longer distances, computationally intensive
Grading Rubric (2 points total):
- 2 points: Correctly identifies one advantage and one disadvantage for both technologies
- 1 point: Correctly identifies advantage/disadvantage for one technology OR partial understanding of both
- 0 points: Incorrect or irrelevant responses
Lesson Reference: 02-depth-sensing.md (Depth Technology Comparison)
Question 8 (Short Answer) - 1 point
What is a "point cloud" in the context of depth sensing, and how does it differ from a regular 2D image?
Reveal Answer & Rubric
Sample Answer: A point cloud is a collection of 3D points in space, where each point has X, Y, and Z coordinates representing the 3D structure of objects in the environment. Unlike a 2D image which only has X, Y coordinates with color/intensity values, a point cloud provides full 3D spatial information about the scene.
Grading Rubric (1 point):
- 1 point: Correctly explains point cloud as 3D points with X, Y, Z coordinates AND differentiates from 2D images
- 0 points: Incorrect or incomplete explanation
Lesson Reference: 02-depth-sensing.md (Point Cloud Fundamentals)
Lesson 3: IMU and Proprioception (Questions 9-11)
Question 9 (Multiple Choice) - 1 point
Which three sensors are typically combined in an IMU for humanoid robot balance?
A) Camera, gyroscope, magnetometer B) Accelerometer, gyroscope, magnetometer C) LiDAR, accelerometer, gyroscope D) GPS, magnetometer, camera
Reveal Answer
Correct Answer: B
Explanation: An IMU (Inertial Measurement Unit) combines three sensor types: accelerometer (measures linear acceleration and gravity), gyroscope (measures angular velocity/rotation rates), and magnetometer (measures magnetic field for heading). Together they provide comprehensive motion and orientation sensing.
Lesson Reference: 03-imu-proprioception.md (What Is an IMU?)
Question 10 (Multiple Choice) - 1 point
What does a gyroscope in an IMU measure for humanoid robot balance?
A) Linear acceleration and gravity B) Angular velocity (rate of rotation) C) Magnetic field direction D) Distance to obstacles
Reveal Answer
Correct Answer: B
Explanation: A gyroscope measures angular velocity - the rate at which the robot is rotating around any axis. This is crucial for balance control as it allows the robot to detect when it's starting to tip or rotate unexpectedly.
Lesson Reference: 03-imu-proprioception.md (Gyroscope Function)
Question 11 (Short Answer) - 2 points
Explain how IMU data can be used to detect if a humanoid robot is falling. Describe the sensor readings that would indicate a fall is occurring.
Reveal Answer & Rubric
Sample Answer: A humanoid robot falling would show characteristic IMU readings: (1) Accelerometer readings would show near-zero linear acceleration during freefall (as the robot and sensors accelerate together at gravity), followed by high acceleration during impact. (2) Gyroscope readings might show rapid, uncontrolled rotation during the fall. (3) The combination of these readings can be processed by fall detection algorithms that look for acceleration magnitude thresholds and rotation patterns that indicate loss of balance.
Grading Rubric (2 points total):
- 2 points: Correctly explains freefall acceleration near zero AND impact acceleration AND mentions gyroscope rotation patterns
- 1 point: Correctly explains one or two aspects of fall detection
- 0 points: Incorrect or irrelevant explanation
Lesson Reference: 03-imu-proprioception.md (Fall Detection with IMU)
Lesson 4: Sensor Fusion (Questions 12-13)
Question 12 (Multiple Choice) - 1 point
Why is sensor fusion essential for humanoid robot perception?
A) It reduces the cost of individual sensors B) It combines multiple sensors to overcome individual limitations and create more robust perception C) It increases the size of the robot D) It eliminates the need for any single sensor
Reveal Answer
Correct Answer: B
Explanation: Sensor fusion combines data from multiple sensors to create more accurate, reliable, and complete information than any single sensor could provide. It allows robots to overcome limitations (e.g., cameras fail in darkness, LiDAR struggles with transparent surfaces) by intelligently integrating complementary strengths.
Lesson Reference: 04-sensor-fusion.md (What Is Sensor Fusion?)
Question 13 (Multiple Choice) - 1 point
What does a complementary filter do in sensor fusion?
A) It eliminates all sensor data B) It combines gyroscope and accelerometer data with configurable time constants to provide drift-corrected orientation C) It only uses camera data D) It doubles the sensor data
Reveal Answer
Correct Answer: B
Explanation: A complementary filter combines gyroscope data (good for short-term rotation tracking) with accelerometer data (good for long-term orientation reference) using configurable time constants. This provides drift-corrected orientation estimation by leveraging the strengths of each sensor type.
Lesson Reference: 04-sensor-fusion.md (Complementary Filtering)
Integration Questions (Questions 14-15)
Question 14 (Short Answer) - 2 points
Design a minimal multi-sensor perception system for a humanoid robot that needs to navigate indoors, avoid obstacles, and maintain balance. Identify which sensors from Module 2 would be essential and explain why each is necessary.
Reveal Answer & Rubric
Sample Answer: Essential sensors would include: (1) Stereo cameras for navigation scene understanding and human interaction, (2) 3D LiDAR for precise obstacle detection and mapping (works in darkness), (3) IMU for balance control and orientation estimation. Cameras provide visual context, LiDAR provides reliable distance measurements for navigation, and IMU provides internal motion sensing for balance. The fusion of all three creates robust perception that works in various lighting conditions and provides both external awareness and internal state knowledge.
Grading Rubric (2 points total):
- 2 points: Identifies at least 2-3 appropriate sensors AND explains specific reasons for each in the context of the task
- 1 point: Identifies sensors but with limited explanation OR identifies only 1-2 sensors with good explanation
- 0 points: Incorrect sensor choices or irrelevant explanations
Lesson Reference: 04-sensor-fusion.md (Multi-Sensor Integration)
Question 15 (Short Answer) - 2 points
Explain how a humanoid robot would use sensor fusion to navigate safely when transitioning from a well-lit indoor area to a dark hallway. What sensors would become more or less important in each environment?
Reveal Answer & Rubric
Sample Answer: In the well-lit area, cameras would provide excellent navigation information for obstacle detection and path planning. As the robot enters darkness, camera effectiveness decreases significantly. The fusion system would automatically reduce weighting of camera data while increasing reliance on LiDAR and depth sensors, which work independently of lighting. The IMU would continue providing balance and orientation information throughout the transition. The robot's navigation system would adaptively combine available sensor data based on confidence levels, maintaining safe navigation throughout the lighting transition.
Grading Rubric (2 points total):
- 2 points: Correctly explains sensor adaptation during lighting transition AND identifies which sensors become more/less important
- 1 point: Explains some aspects of sensor adaptation OR identifies some sensor changes
- 0 points: Incorrect or irrelevant explanation
Lesson Reference: 04-sensor-fusion.md (Adaptive Sensor Weighting)