Overall Process
<!— image: image (from original wiki uploads) —>
- (1) The project sponsor needs to decide if they plan to host their data in the ACE infrastructure or will they utilize a cloud hosted by another team.
- (2) If the project sponsor wants to use the ACE infrastructure, follow the process to request a new workspace
- (3a) ACE Infra team creates the required AWS resources for the integration and works with Labelbox to create the delegated access. In addition, malware scanning tools are configured for scanning the staging bucket for malware.
- (3b)In parallel, the ACE PM team will work with the project sponsor and the IMO team to ensure data privacy requirements are met before the data os copied into the ACE environment.
- (4) The project data custodian can then copy the data to the designated S3 bucket using the AWS CLI. Once data is copied to the designated staging bucket by the data custodian, an automated process runs to scan the staging bucket for malware. If no malware is found, the data is synced with the Labelbox application bucket. If malware is discovered, the files are quarantined and admins are alerted.
Requesting a New Labelbox Workspace
| :memo: | Workspaces is a feature that enables the creation of a logical separation to manage multiple instances of Labelbox with the same login. Each “Workspace” is essentially a separate instance of Labelbox with its own set of data rows, annotations, and users. |
|---|
To request a new Workspace follow the instructions below.
-
Go to the Github ACE Infra Labelbox Hybrid Workspace Request Template
-
Complete all of the inputs and be specific about which users need which roles. The inputs required are the Workpsace name the unix id of the
Project Adminrole. Without this role defined or the Workspace name identified, we cannot move forward with the creation of the WorkSpace. -
Next, send a slack message to in the Labelbox slack channel or an email to
support@labelbox.comand ccgred-ace-infra-d@gene.comrequesting the new Workspace and providing the WorkSpace name and the unix id of the Project Admin.
Upload Data for Use with ACE Labelbox
:warning: Data cannot be uploaded through the Labelbox web UI. Data must be copied to the delegated access S3 bucket. The process defined below is only applicable if the ACE Infra Team configured your Workspace delegated access for you. For information on how to link data from S3 buckets to labelbox see the wiki page about that
Prerequisite
- The Workspace with staging and application bucket must be configured
- The data onboarding process must be complete
Process
- Use the AWS CLI to the AWS Console to access copy data from the source to the staging s3 bucket you have been given access to.
How to Access ACE Labelbox
- With your web browser, proceed to https://app.labelbox.com/ There, you will see the following login prompt for the Labelbox application:
<!— image: image (from original wiki uploads) —>
- Enter your email address in the “your company email” field. Your email address must be from one of the Genentech / Roche domains:
gene.com,roche.com,businesspartner.roche.com,exbp.roche.com, orcontractors.roche.com
NOTE: When you enter your email from an approved domain, the “password” field will disappear as part of the redirect to Roche SSO.
- After clicking the blue “LOG IN >” button, you will be directed to the Roche Single Sign-On (SSO) page to complete your authentication:
<!— image: image (from original wiki uploads) —>
- If this is your first time logging in, you will NOT have access to any projects by default. Please reach out to
Miao ZhangorKent Horvathto get added to the right project for ACE Projects. If other Workspaces are being accessed, please reach out to your Workspace contact to get added to the right project.