ACE PlatformDR Checklist — Talend

Business Function

Talend (Vendor product) is the enterprise solution for all data integration projects within Roche.
Enterprise Cloud Data Integration (ECDI) team manages the Talend platform at Roche.

Genentech Clinical Operations Reporting (gCORE) is an Operation Data Warehouse used by Genentech Research and Early Development (gRED). gCORE uses Talend as data integration tool

Talend has two components.
I. Cloud Infrastructure (Administration of users, roles, projects and license etc)
II. Local Environment (Remote Engine, Talend Studio)

Local Environment is managed by gRED - ACE Infra Team and the scope of this document is for Local Environment.

There are four talend environments:

  • dev
  • qa
  • uat
  • prod

Talend Product Architecture

Talend Architecture

Reference: https://help.talend.com/r/en-US/Cloud/talend-cloud-getting-started/tic-architecture

Scope

The scope of this document is for prod running in the ACE AWS Account.

The regional scope of this Recovery Checklist is the us-west-2 region of AWS.

Out of Scope

  • Multi-region availability is out of scope per business requirements.

Key Information

  • Talend Remote Engine and Talend Runtime 64-bit Linux software is needed
  • One time creation of Environment,Workspace, Engine, Key creation needs to be done by ECDI Team on Talend Server

Recovery Checklist & Dependencies

The following tables provide the step-by-step recovery checklist that the ACE Infra team can use to guide them through the recovery steps for the Talend Environment.

StatusTask SummaryTask Details
completeTalend Remote Engine Software downloadDownload software to your desktop Remote Engine - Nexus URL
completeTalend Runtime Software downloadDownload software to your desktop Runtime - Nexus URL
completeTalend software download problemIf you are unable to download the software reach out to gloecdi_opsteam@msxdl.roche.com
completeType Unzip Software downloadUnzip Talend Remote Engine and Talend Runtime software downloaded
completeTalend software binary checkAbove unzip step is to make sure software binary is downloading successfully. It will not be used for installation
completeTalend software download problem from EC2Check Route 53 —> Resolver —> Rules [nexus-gtm-roche-com-rule-outbound & nexus-roche-com-rule-outbound] if talend download problem from EC2.More info is on https://github.com/gred-ecdi/terraform-ace-prod/blob/master/us-west-2/infra-route53-dns/main.tf
completeTalend License Remote Engine creation requestRaise ticket to create remote enginge and provide the Remote Engine key by raising ticket http://dpt-support.roche.com
completeTalend Key placementPlace the Talend key on https://github.com/gred-ecdi/terraform-ace-prod/blob/master/us-west-2/infra-talend-reprod/main.tf —> talend_key
completeTerraform initrun terraform init
completeTerraform planAfter successful run of terraform init, run terraform plan
completeTerraform applyAfter successful run of terraform plan, run terraform apply
completes3 bucketGet the s3 bucket name prefix from Terraform, it will be something like infra-talend-reprod
completeLog into AWSLog into the AWS Console using this link
completeNavigate to s3Navigate to s3 service and look for the bucket with s3 bucket name prefix which is captured above
completeCheck Talend installer log on s3Inside the s3 bucket look for a folder called stack-data, navigate inside that folder and then to the folder with EC2 instance name
completeValidate Log file on s3Open all the logs files under the bucket listed on the step above and make sure there is no errors on it
completeValidate through Talend StudioIf you have access to Talend Studio, logon to Talend studio and try running any sample job
completeAdd RE to ClusterRaise ticket to talend team http://dpt-support.roche.com to add remote engines created above to the ACE_DE_PROD Cluster
Common issue & Fixkms pull installationInstall kms pull manually; Logon to EC2; cd /talend_002/kmspull; ./install_kmspull_from_root -eetl_admin_prod infra-talend-reprod-1.gred.ai; chmod 755 /usr/local/bin/kmspull; Repeate this steps on all the servers

Troubleshooting Steps: I. Key update and Restart Remote Engine

  • Stop the engine: systemctl stop talend-remote-engine.service

  • Replace license key cd /opt/Talend/etc Open preauthorized.key.cfg file for editing and update remote.engine.pre.authorized.key parameter with the new key Save the file

  • Directory Cleanup rm -rf /opt/Talend/data/* Open /opt/Talend/etc/org.talend.ipaas.rt.pairing.agent.cfg file and remove value for remote.engine.id parameter. Save the file.

  • Start the engine systemctl start talend-remote-engine.service

  • Check Status systemctl status talend-remote-engine.service

II. ECD Talend Team Contact

III. Talend Error Fix

  • If Terraform Talend has any problem it is easier to get new key and spin up a new instance. It is much easier