Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

azure.ai.evaluation.evaluate returning invalid json for groundedness_pro evaluation #39689

Open
soferreira opened this issue Feb 12, 2025 · 1 comment
Labels
bug This issue requires a change to an existing behavior in the product in order to be resolved. Client This issue points to a problem in the data-plane of the library. customer-reported Issues that are reported by GitHub users external to the Azure organization. Evaluation Issues related to the client library for Azure AI Evaluation needs-team-attention Workflow: This issue needs attention from Azure service team or SDK team Service Attention Workflow: This issue is responsible by Azure service team.

Comments

@soferreira
Copy link

  • Package Name:
    azure-ai-evaluation 1.2.0

  • Package Version:
    azure-ai-evaluation 1.2.0
    azure-ai-inference 1.0.0b8
    azure-ai-projects 1.0.0b5

  • Operating System:
    linux

  • Python Version:
    Python 3.10.8

Describe the bug

azure.ai.evaluation.evaluate is returning an invalid json for some of the groundedness_pro evaluations

To Reproduce
Steps to reproduce the behavior:

  1. Run evaluate with evaluator:
    results = evaluate(
        evaluation_name=f"Eval-Run-{randomNum}-{model}",
        data="input_data.jsonl",
        target=ModelEndpoints(model_endpoints, model),
        evaluators=evaluators,
        azure_ai_project=azure_ai_project,
        evaluator_config=evaluator_config,
        output_path="./myevalresults.json"
    )

Evaluators:

evaluators = {}
groundedness_pro_evaluator = GroundednessProEvaluator(azure_ai_project=azure_ai_project, credential=credential)
evaluators["groundedness_pro"] = groundedness_pro_evaluator

Column mapping with atributes from input_data.jsonl and target response:

"column_mapping": {
    "context": "AML API only"",
    "response": "To provide you with the most accurate guidance, I need a bit more context about what you mean by \"create a run.\" The term \"run\" can be interpreted differently depending on the context. Here are a few possibilities:\n\n1. **Running as Exercise**: If you are referring to planning a running route for exercise, you can create a run by mapping out a route using apps like Google Maps, Strava, or MapMyRun. Choose a safe area, consider the distance you want to cover",
    "query": "How do you create a run?"",
},

Expected behavior
evaluate() is returning an invalid json

Screenshots
Image

@github-actions github-actions bot added customer-reported Issues that are reported by GitHub users external to the Azure organization. needs-triage Workflow: This is a new issue that needs to be triaged to the appropriate team. question The issue doesn't require a change to the product in order to be resolved. Most issues start as that labels Feb 12, 2025
@xiangyan99 xiangyan99 added bug This issue requires a change to an existing behavior in the product in order to be resolved. Service Attention Workflow: This issue is responsible by Azure service team. Evaluation Issues related to the client library for Azure AI Evaluation and removed question The issue doesn't require a change to the product in order to be resolved. Most issues start as that needs-triage Workflow: This is a new issue that needs to be triaged to the appropriate team. labels Feb 12, 2025
@github-actions github-actions bot added the needs-team-attention Workflow: This issue needs attention from Azure service team or SDK team label Feb 12, 2025
Copy link

Thanks for the feedback! We are routing this to the appropriate team for follow-up. cc @luigiw @needuv @singankit.

@kristapratico kristapratico added the Client This issue points to a problem in the data-plane of the library. label Feb 12, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug This issue requires a change to an existing behavior in the product in order to be resolved. Client This issue points to a problem in the data-plane of the library. customer-reported Issues that are reported by GitHub users external to the Azure organization. Evaluation Issues related to the client library for Azure AI Evaluation needs-team-attention Workflow: This issue needs attention from Azure service team or SDK team Service Attention Workflow: This issue is responsible by Azure service team.
Projects
None yet
Development

No branches or pull requests

3 participants