Description:ScopeThe client is seeking an engagement with an experienced candidate with qualifications in the listed technologies to aid the NST division and CNM User Facility in work to automate/standardize the process of collecting and storing data for AI training and collaborative ventures.ObjectivesThe overall goal of this assignment is to deliver a functional and impactful workflow to aid research scientists across the NST division with data management, including, but not limited to:Identifying location of data and the methodology to determine which data is to be managed through the workflowFiltering, sanitizing, normalizing and/or otherwise preparing the data for storage/sharingPackaging the data with AI-enabling metadata into an HDF5 format that can be used for long-term storageCreating a mechanism to migrate and inventory data for easy retrieval and/or sharingBuilding an interface/workflow that is accessible and easy to use by the research staffDevelop documentation of solutionsConstruct all components, artifacts, etc. using acceptable tools and techniques, good coding practices and accessible documentationTasks and DeliveryThree projects have been identified that require data manipulation, sanitization, orchestration and automation. During the first 2-3 weeks, engagements with the scientific domain experts will provide details on understanding their datasets, locations and desired outcomes.Once the details have been identified by the domain scientists, work will begin through an iterative development process to create a compatible workflow, using existing or new tools to achieve the desired outcomes. All tools, software and related materials will require compliance with local cybersecurity and IT standards as well as documentation of operation. Development under revision control is required.Delivery to the domain scientists of an MVP within the first 90-days of the project is expected. The MVP need not address all of the key deliverable outcomes but must demonstrate the core of the workflow desired.Further refinement, refactoring and feature addition is expected beyond the 90-day window.Once the domain scientists have accepted the solution, the candidate will be assigned a new set of domain experts and a new workflow to create. It is expected that each contractor will have no fewer than two active development tasks.Required Background, Knowledge, Skills and AbilitiesKnowledge/Skill/Ability:A strong working understanding of AI and LLMsAbility to develop efficient prompts and structured workflows with AI integrationFamiliarity with AI workflow orchestrators (e.x. Dify)Familiarity with relational databases (e.x. MySql/MariaDB, Oracle, etc.)Working knowledge of relevant programming/scripting languages (e.x. python) for automating tasksWorking knowledge of revision control systems (e.x. Git)Working knowledge of CI/CD pipelines as implemented in services like gitlab/github for testing and deploying solution changes automaticallyFamiliarity with RESTful web servicesStrong skills in Linux (Debian and RHEL) CLI administration, development and general useFamiliarity with GUI development either through web or python applicationsStrong written and oral communication skills providing the ability to develop requirements for tasks through interactions with technical experts in the fieldBackgroundAssociate or bachelor’s degree in computer science, information technology, system administration, or a closely related field, or equivalent experience required2 or more years of software development and/or system administrationNotes:Mostly off-site schedule (up to 80%) with 40 hours per weekVIVA is an equal opportunity employer. All qualified applicants have an equal opportunity for placement, and all employees have an equal opportunity to develop on the job. This means that VIVA will not discriminate against any employee or qualified applicant on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status
Description:ScopeThe client is seeking an engagement with an experienced candidate with qualifications in the listed technologies to aid the NST division and CNM User Facility in work to automate/standardize the process of collecting and storing data for AI training and collaborative ventures.ObjectivesThe overall goal of this assignment is to deliver a functional and impactful workflow to aid research scientists across the NST division with data management, including, but not limited to:Identifying location of data and the methodology to determine which data is to be managed through the workflowFiltering, sanitizing, normalizing and/or otherwise preparing the data for storage/sharingPackaging the data with AI-enabling metadata into an HDF5 format that can be used for long-term storageCreating a mechanism to migrate and inventory data for easy retrieval and/or sharingBuilding an interface/workflow that is accessible and easy to use by the research staffDevelop documentation of solutionsConstruct all components, artifacts, etc. using acceptable tools and techniques, good coding practices and accessible documentationTasks and DeliveryThree projects have been identified that require data manipulation, sanitization, orchestration and automation. During the first 2-3 weeks, engagements with the scientific domain experts will provide details on understanding their datasets, locations and desired outcomes.Once the details have been identified by the domain scientists, work will begin through an iterative development process to create a compatible workflow, using existing or new tools to achieve the desired outcomes. All tools, software and related materials will require compliance with local cybersecurity and IT standards as well as documentation of operation. Development under revision control is required.Delivery to the domain scientists of an MVP within the first 90-days of the project is expected. The MVP need not address all of the key deliverable outcomes but must demonstrate the core of the workflow desired.Further refinement, refactoring and feature addition is expected beyond the 90-day window.Once the domain scientists have accepted the solution, the candidate will be assigned a new set of domain experts and a new workflow to create. It is expected that each contractor will have no fewer than two active development tasks.Required Background, Knowledge, Skills and AbilitiesKnowledge/Skill/Ability:A strong working understanding of AI and LLMsAbility to develop efficient prompts and structured workflows with AI integrationFamiliarity with AI workflow orchestrators (e.x. Dify)Familiarity with relational databases (e.x. MySql/MariaDB, Oracle, etc.)Working knowledge of relevant programming/scripting languages (e.x. python) for automating tasksWorking knowledge of revision control systems (e.x. Git)Working knowledge of CI/CD pipelines as implemented in services like gitlab/github for testing and deploying solution changes automaticallyFamiliarity with RESTful web servicesStrong skills in Linux (Debian and RHEL) CLI administration, development and general useFamiliarity with GUI development either through web or python applicationsStrong written and oral communication skills providing the ability to develop requirements for tasks through interactions with technical experts in the fieldBackgroundAssociate or bachelor’s degree in computer science, information technology, system administration, or a closely related field, or equivalent experience required2 or more years of software development and/or system administrationNotes:Mostly off-site schedule (up to 80%) with 40 hours per week
(Please ensure email matches your resume email)
(document types allowed: doc/docx/rtf/pdf/txt) (max 20MB)
By submitting this form, you are consenting to the VIVA team contacting you via Phone/Email