
$0-$0 / yr
Salary
colombia
Region
ASAP
Start Date
Gramian Consultancy brings together the perspective of a software engineer, the knowledge of a technical recruiter, and the vision of a business builder. This unique experience is our signature advantage to delivering top quality services in the domain of recruiting, staff augmentation, and outsourcing.
About Us
Gramian Consultancy is a boutique consultancy specializing in IT professional services and engineering talent solutions. With a strong background in software engineering and leadership, we help companies build high-performing teams by matching them with professionals who truly fit their needs.
Role Overview
We are looking for Computer Science engineers with PhD diploma to contribute to advanced AI model evaluation projects focused on reasoning-heavy computer science challenges. In this role, you will help assess and improve frontier AI systems by creating complex conceptual problems and evaluating the quality of AI-generated reasoning across multiple CS domains.
You will work on topics such as algorithms, systems design, databases, cybersecurity, computer networks, distributed systems, and other advanced undergraduate to PhD-level computer science areas. The ideal candidate combines strong theoretical foundations with the ability to critically analyze technical reasoning, edge cases, assumptions, and solution correctness.
CONTRACT: Contractor assignment (12-week project), paid per completed task
COMMITMENT: 20-40 hours/week with minimum 4 hours PST overlap
LOCATION: Remote — Bangladesh, Brazil, Colombia, Egypt, Ghana, India, Pakistan, Indonesia, Kenya, Nigeria, Turkey, Vietnam
PROCESS: 1 Online Test + 1 Interview
NOTE: Accepted candidates need to go through criminal background check before starting work.
Responsibilities
Design advanced computer science problems focused on reasoning and conceptual understanding
Create structured reference solutions with clear logic and technical explanations
Evaluate AI-generated answers for correctness, reasoning quality, completeness, and clarity
Identify edge cases, logical flaws, and weak reasoning patterns in model outputs
Review complex topics across algorithms, systems, databases, networking, security, and theory
Contribute to benchmark and evaluation quality improvements
Provide detailed feedback to support AI model training and evaluation
Collaborate with researchers and reviewers on problem refinement
Maintain high-quality standards and consistent evaluation methodology
Requirements
PhD in Computer Science or a closely related field (completed or in progress)
Strong theoretical and problem-solving background across advanced CS domains
Ability to design conceptual and reasoning-heavy technical problems
Strong understanding of algorithms, systems, databases, networking, or cybersecurity
Excellent written English communication skills
Strong analytical skills and attention to technical detail
Ability to critically evaluate reasoning, assumptions, and solution validity