Comprehensive unittests for generated APIs #104

cragwolfe · 2023-01-11T05:02:23Z

Objective

Add unittests to test the generated APIs for the many permutations of potential pipeline_api definitions. There are a lot of tests to write and permutations to test. As such, multiple incremental PR's are strongly preferred over a mega PR.

Create a pipeline-test project as a fixture for testing

The root of the pipeline-test project is a barebones preprocessing pipeline family project. It should include a pipeline-notebooks directory with test pipeline notebooks, an empty prepline_test directory, and a preprocessing-pipeline-family.yaml file. pipeline test may exist under test_unstructured_api_tools/fixtures (or another reasonable place).

Each pipeline-notebook includes the definition for a pipeline_api, so there is a pipeline notebook for each of the following:

pipeline_api permutations to test:

def pipeline_api(text)
def pipeline_api(text, m_input1=[], m_input2=[])
def pipeline_api(text, response_type="text/csv")
def pipeline_api(text, response_type="application/json", response_schema="isd")
def pipeline_api(file)
def pipeline_api(file, response_type="text/csv", response_schema="isd")
def pipeline_api(file, file_content_type, response_type="application/json", response_schema="labelstudio", m_input1=[])
def pipeline_api(file, file_content_type, filename, response_type="application/json", response_schema="isd", m_input2=[], m_input1=[])
def pipeline_api(text, file, file_content_type, filename)
def pipeline_api(text, file, file_content_type, filename, response_type="application/json", m_input2=[])
def pipeline_api(text, file, file_content_type, filename, response_type="application/json", response_schema="isd")
def pipeline_api(text, file, file_content_type, filename, response_type="application/json", response_schema="isd", m_input1=[], m_input2=[])

The API's should execute some trivial code to validate they are handling an uploaded file or text file appropriately. E.g., make sure the length of the content is as expected (and/or, the first few characters make sense).

Test Cases Against the Generated FastAPI Routes

For each generated API, run a FastAPI TestClient and submit an HTTP Post to cover a number of permutations, including:

single text file posted (e.g. curl short hand of -F [email protected])
multiple text files posted (e.g. curl short hand of -F [email protected] -F [email protected] )
Repeat the above two bullets, except for non-text files. In this case the form parameter is files rather than text_files

For all of the above, test with:

The curl equivalent of -F input1=mytestvalue
The curl equivalent of -F input2=val2
The curl equivalent of -F input1=anothervalue -F input2=atestvalue2

For "single text file posted" or "single non-text file posted" cases, test:

The curl equivalent of -H 'Accept: application/json'
- And optionally -F output_schema=isd or -F output_schema=labelstudio
The curl equivalent of -H 'Accept: text/csv'
- And optionally -F output_schema=isd
The curl equivalent of -H 'Accept: application/notsupported

For "multiple text files posted" or "multiple non-text files posted" cases, test:

The curl equivalent of -H 'Accept: application/json'
- where -F output_format=application/json , -F output_format=text/csv (an error), or output_format is not included
- for valid cases in the above bullet, also test where the form parameter output_schema is not present, or with a value of: -F output_schema=labelstudio', -F output_schema=isd, or -F output_schema=non-sensical
Same as the above except with -H 'Accept: multipart/mixed'
- Only unlike the above -F output_format=text/csv is valid (text/csv per part)
Same as the above except with -H 'Accept: application/notsupported

Linting checks

Finally, a test should run flake8 and mypy against the the api/ modules to ensure that the library is generating clean code.

Definition of Done (Initial PR)

As mentioned, multiple PR's are preferred. The initial PR should at least cover a few pipeline(text,... cases.

The text was updated successfully, but these errors were encountered:

cragwolfe · 2023-02-04T00:27:14Z

Assigned to kravetsmic (will update Assignee once invite is accepted)

*Sets the pattern for the comprehensive unittests in Issue #104 . * Adds a pipeline test project with a root directory of test_unstructured_api_tools/pipeline-test-project/ * Generates web API's test_unstructured_api_tools/pipeline-test-project/prepline_test_project/api/ from notebooks under test_unstructured_api_tools/pipeline-test-project/pipeline-notebook with make generate-test-api * Ensures the generated API's match expected in CI with make api-check-test * Adds FastAPI client tests against the generated API's in test_unstructured_api_tools/api/test_file_apis.py There are still many notebooks to add under test_unstructured_api_tools/pipeline-test-project/pipeline-notebook and many more permutations to test per #104 , but this is establishing the pattern.

Adds test notebooks for #104 .

cragwolfe added the help wanted Extra attention is needed label Jan 12, 2023

cragwolfe changed the title ~~Comprehensive unittests against functionality of generated APIs~~ Comprehensive unittests for generated APIs Jan 12, 2023

cragwolfe mentioned this issue Jan 12, 2023

Ability to accept gzip compressed files #107

Closed

cragwolfe added the python Pull requests that update Python code label Jan 13, 2023

cragwolfe mentioned this issue Mar 3, 2023

chore: start adding missing end to end tests #138

Merged

kravetsmic mentioned this issue Mar 8, 2023

Added notebooks for tests #142

Merged

cragwolfe pushed a commit that referenced this issue Mar 15, 2023

Added notebooks for tests (#142)

4e0087a

Adds test notebooks for #104 .

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comprehensive unittests for generated APIs #104

Comprehensive unittests for generated APIs #104

cragwolfe commented Jan 11, 2023 •

edited

Loading

cragwolfe commented Feb 4, 2023 •

edited

Loading

Comprehensive unittests for generated APIs #104

Comprehensive unittests for generated APIs #104

Comments

cragwolfe commented Jan 11, 2023 • edited Loading

Objective

Create a pipeline-test project as a fixture for testing

pipeline_api permutations to test:

Test Cases Against the Generated FastAPI Routes

Linting checks

Definition of Done (Initial PR)

cragwolfe commented Feb 4, 2023 • edited Loading

cragwolfe commented Jan 11, 2023 •

edited

Loading

cragwolfe commented Feb 4, 2023 •

edited

Loading