KWIVER Processes in ACT Detector

ACT [KWFS17]

class act_detector.ACTDetector(conf)[source]

Forward pass for ACT

  • Input Ports:
    • rgb_image RGB image (Required)
    • flow_image Flow image (Required)
    • timestamp Timestamp associated with the images (Required)
    • file_name Name of the input source (Required)
  • Output Ports:
    • object_track_set Tracks produced by forward pass of RC3D
  • Configuration:
    • exp Experiment configuration used by ACT (Eg. exp.yml)
    • model_itr Model number associated with with the weight file (default=60000)
    • img_width Original image width (default=1920)
    • img_height Original image height (default=1080)
    • gpu GPU index used by ACT (default=0)
class act_json_writer.ACTJsonWriter(conf)[source]

Write object_track_set from ACT and detected_object_set from an object detector in NIST specified JSON format

  • Input Ports:
    • object_track_set Tracks obtained from ACT (Required)
    • timestamp Timestamp associated with the input from which tracks were obtained (Required)
    • file_name Name of the input source (Required)
    • detected_object_set Detections obtained from object detector
  • Output Ports:
    • None
  • Configuration:
    • exp Experiment configuration for ACT (Eg. exp.yml)
    • is_aod Flag to specify the task for which ACT is used (Default=False)
    • confidence_threshold Lower bound for confidence associated with an activity (Default=0.2)
    • json_path Path to json file produced by the writer (default=sysfile.json)
class act_visualizer.ACTVisualizer(conf)[source]

Render track_object_set from ACT and detected_object_set from object detector on an image

Note

The tracks visualized are intermediate tracks and not the final track.

  • Input Ports:
    • image Input RGB image used by ACT (Required)
    • timestamp Timestamp associated with the image (Required)
    • object_track_set Tracks obtained from ACT (Required)
    • detected_object_set Detections obtained from object detector (Optional)
  • Output Ports:
    • image Output image with bounding box representing spatial localization of an activity
  • Configuration:
    • exp Experiment configuration for ACT (Eg. exp.yml)
    • is_aod Flag to specify the task for which ACT is used (Default=False)
class modify_bbox_resolution.ModifyBboxResolution(conf)[source]

Modify the bounding box based on the difference between input and output image resolution

  • Input Ports:
    • detected_object_set Set of input detections (Required)
  • Output Ports:
    • detected_object_set Set of scaled detections (Optional)
  • Configuration:
    • input_image_width Width of the image from which input detections were generated (default=512)
    • input_image_height Height of the image from which input detections were generated (default=512)
    • output_image_width Width of the image that output detections would be scaled to (default=1920)
    • output_image_height Height of the image that output detections would be scaled to (default=1080)
class merge_tubes.MergeTubes(conf)[source]

Merge tubelets produced by ACT to create larger tubes.

  • Input Ports:
    • object_track_set Intermediate detections from ACT forward pass (Required)
    • file_name Input source (Required)
    • timestamp Timestamp assocaited with the input from which intermediated tracks were computed (Required)
  • Output Ports:
    • object_track_set Finished tubes
    • current_object_track_set Incomplete tubes
  • Configuration
    • exp Experiment configuration for ACT (Eg. exp.yml)
    • num_classes Number of classes that ACT was trained on (default=20)

[KWFS17]Vicky Kalogeiton, Philippe Weinzaepfel, Vittorio Ferrari, and Cordelia Schmid. Action Tubelet Detector for Spatio-Temporal Action Localization. In ICCV 2017 - IEEE International Conference on Computer Vision. Venice, Italy, October 2017.