Add simplified integration framework #42450

rdner · 2025-01-28T14:44:44Z

That can be used for running Beats binaries in integration tests.

This framework has a simplified API comparing to the existing one and uses a more efficient way to search for logs in the output of the command.

Checklist

My code follows the style guidelines of this project
I have commented my code, particularly in hard-to-understand areas
~~- [ ] I have made corresponding changes to the documentation~~
~~- [ ] I have made corresponding change to the default configuration files~~
I have added tests that prove my fix is effective or that my feature works
~~- [ ] I have added an entry in CHANGELOG.next.asciidoc or CHANGELOG-developer.next.asciidoc.~~

Motivation

Recently I needed to write a few integration tests that run the Filebeat binary and I found the existing integration framework very cumbersome. It requires a lot of boilerplate code and the purpose of any framework is to illuminate that.

Since we're going to eventually migrate all the integration tests written in Python, we need to have a very simple tool for writing integration tests that should cover most of the test-cases.

Both frameworks (old and new) can co-exist for now.

Pros

This design has a few advantages over the previous one:

It does not use log files for inspecting/matching the expected logs. Instead it connects directly to stdout/stderr and matches all the output expectations in memory line by line as they arrive. Which makes it extremely efficient at expecting thousands of log lines (e.g. confirming each line of a file gets ingested).
The test suite kills the process immediately once the defined expectations are met, no more polling with intervals.
It runs the binary that we ship to our customers instead of a custom binary (debatable, I know but I think we should test what we ship)
It has a call-chain interface which is more compact
It supports testing cases when a Beat crashes with errors (not sure if the old framework supported that)
It has very detailed output for debugging a test failure
It's generic and in theory can be used with any Beat not just Filebeat (yet to be tested)
Can be extended and specialized for each Beat, see the example with Filebeat in this PR.

For example, test similar to the one written in the older framework:

https://github.com/elastic/beats/blob/main/filebeat/tests/integration/filebeat_test.go#L37-L94

Can be replaced with this:

func TestFilebeat(t *testing.T) {
	messagePrefix := "sample text message"
	fileCount := 5
	lineCount := 128
	configTemplate := `
filebeat.inputs:
  - type: filestream
    id: "test-filestream"
    paths:
      - %s
# we want to check that all messages are ingested
# without using an external service, this is an easy way
output.console:
  enabled: true
`
	// we can generate any amount of expectations
	// they are light-weight
	expectIngestedFiles := func(test Test, files []string) {
		// ensuring we ingest every line from every file
		for _, filename := range files {
			for i := 1; i <= lineCount; i++ {
				line := fmt.Sprintf("%s %s:%d", messagePrefix, filepath.Base(filename), i)
				test.ExpectOutput(line)
			}
		}
	}

	ctx, cancel := context.WithTimeout(context.Background(), 30*time.Second)
	defer cancel()

	generator := NewJSONGenerator(messagePrefix)
	path, files := GenerateLogFiles(t, fileCount, lineCount, generator)
	config := fmt.Sprintf(configTemplate, path)
	test := NewTest(t, TestOptions{
		Config: config,
	})

	expectIngestedFiles(test, files)

	test.
		ExpectEOF(files...).
		ExpectStart().
		Start(ctx).
		Wait()
}

Additionally, it also includes validation that:

every file has been read until EOF
every line from each file was ingested

Another example, this time we expect Beat to crash:

func TestFilebeat(t *testing.T) {
	ctx, cancel := context.WithTimeout(context.Background(), 10*time.Second)
	defer cancel()

	// path items are required, this config is invalid
	config := `
filebeat.inputs:
  - type: filestream
    id: "test-filestream"
output.console:
  enabled: true
`
	test := NewBeatTest(t, BeatTestOptions{
		Beatname: "filebeat",
		Config:   config,
	})

	test.
		ExpectStart().
		ExpectOutput("Exiting: Failed to start crawler: starting input failed: error while initializing input: no path is configured").
		ExpectStop(1).
		Start(ctx).
		Wait()
}

Cons

This framework does not have all the functionality of the previous one. Only essential functions that cover most of the test-cases. It can be easily extended, every defined interface is an extension point.
It heavily relies on the terminal output of a Beat, we can configure both console logging and file logging if necessary but it's outside of the scope of this framework.

Current functionality

Basic Assertions

Assert an output line that contains a defined string
Assert a list of output lines in a defined order that contain a given list of strings
Assert an output line that matches a regular expression
Assert a list of output lines in a defined order that match a given list of regular expressions
Assert that the process started
Assert that the process exited by itself with a certain exit code

Filebeat-specific Assertions

Assert all files on the list have been read to EOF

Reporting

Print out all defined expectations of the test
Print last N lines of the output

Config

Add additional arguments to the command to run the binary
Pass a config file (e.g. filebeat.yml)

mergify · 2025-01-28T14:45:30Z

This pull request does not have a backport label.
If this is a bug or security fix, could you label this PR @rdner? 🙏.
For such, you'll need to label your PR with:

The upcoming major version of the Elastic Stack
The upcoming minor version of the Elastic Stack (if you're not pushing a breaking change)

To fixup this pull request, you need to add the backport labels for the needed
branches, such as:

backport-8./d is the label to automatically backport to the 8./d branch. /d is the digit

mergify · 2025-01-28T14:45:31Z

backport-8.x has been added to help with the transition to the new branch 8.x.
If you don't need it please use backport-skip label and remove the backport-8.x label.

elasticmachine · 2025-01-29T15:44:00Z

Pinging @elastic/elastic-agent-data-plane (Team:Elastic-Agent-Data-Plane)

libbeat/testing/integration/integration.go

belimawr

Overall it's a very nice initiative. The only thing I'm missing is a readme/documentation about which framework to sue and the features this one has.

You wrote a very detailed PR description with examples, etc. That could become the readme. What do you think?

belimawr · 2025-02-10T18:05:02Z

filebeat/testing/integration/integration.go

+// NewTest creates a new integration test for Filebeat.
+func NewTest(t *testing.T, opts TestOptions) Test {
+	return &test{
+		BeatTest: integration.NewBeatTest(t, integration.BeatTestOptions{
+			Beatname: "filebeat",
+			Config:   opts.Config,
+			Args:     opts.Args,
+		}),
+	}
+}


[Suggestion]
Either rename NewTest to something like NewFilebeatTest or add Beatname as argument, so it's clear whether this function is specific to a Filebeat test or generic.

Since the full import is already filebeat/testing/integration it does not seem necessary to repeat that again in the function name.

filebeat/testing/integration/log_generator.go

belimawr · 2025-02-10T20:16:09Z

libbeat/testing/integration/run_beat.go

+	cfgPath := filepath.Join(t.TempDir(), fmt.Sprintf("%s.yml", opts.Beatname))
+	homePath := filepath.Join(t.TempDir(), "home")


It's not a big deal, but each call to t.TempDir returns a different directory. Why not use a single directory?

On a different topic, t.TempDir is very convenient, however when debugging failing (or flaky) tests often I find myself needing to look at the files the test is using, that's why our other integration tests framework keeps the test folder on failure.

What do you think about implementing a similar logic here?

I can't come up with a test case when you would need to look at the configuration file you generated and at the registry file. Logging is going to stderr instead.

If the test fails it prints the last N lines from the output, it's configurable.

Everything should be observed through the output of the Beat. That's my idea of integration tests, we should test everything like our customers run it. For example, if you want to see the stored offset in the state, it should be tested through ingested lines, not inspecting the registry. The registry implementation should be covered by its unit tests instead.

Good catch with the temporary directory though, I was not aware it returned a different directory every time.

I can't come up with a test case when you would need to look at the configuration file you generated and at the registry file.

It's not a test case, it's us (human engineers) debugging/writing a test, where something is not working as expected and we need to debug/investigate manually rather than just running the tests.

At least for me, this is a very common workflow.

Anyways, it does not need to be implemented or discussed in depth on this PR. We can leave it as it is and later discuss/iterate on it.

Everything should be observed through the output of the Beat. That's my idea of integration tests, we should test everything like our customers run it.

I agree!

For example, if you want to see the stored offset in the state, it should be tested through ingested lines, not inspecting the registry. The registry implementation should be covered by its unit tests instead.

That I disagree because having every possible state we need to test in the logs can be very verbose and heavy. Anyways, that's not a discussion for now.

libbeat/testing/integration/sample_test.go

That can be used for running Beats binaries in integration tests. This framework has a simplified API comparing to the existing one and uses a more efficient way to search for logs in the output of the command.

rdner · 2025-02-11T09:13:17Z

@belimawr I've added a README and addressed your feedback.

rdner · 2025-02-11T12:33:28Z

@AndersonQ suggested a good idea to always rebuild the binary when we run EnsureCompiled. I do it with the hash check now, this commit bb39651

filebeat/testing/integration/log_generator.go

libbeat/testing/integration/integration.go

libbeat/testing/integration/output_watcher_test.go

Add simplified integration framework That can be used for running Beats binaries in integration tests. This framework has a simplified API comparing to the existing one and uses a more efficient way to search for logs in the output of the command. (cherry picked from commit a6ab04f)

Add simplified integration framework That can be used for running Beats binaries in integration tests. This framework has a simplified API comparing to the existing one and uses a more efficient way to search for logs in the output of the command. (cherry picked from commit a6ab04f) Co-authored-by: Denis <[email protected]>

rdner self-assigned this Jan 28, 2025

botelastic bot added the needs_team Indicates that the issue/PR needs a Team:* label label Jan 28, 2025

rdner added the Team:Elastic-Agent-Data-Plane Label for the Agent Data Plane team label Jan 28, 2025

botelastic bot removed the needs_team Indicates that the issue/PR needs a Team:* label label Jan 28, 2025

mergify bot added the backport-8.x Automated backport to the 8.x branch with mergify label Jan 28, 2025

rdner added backport-skip Skip notification from the automated backport with mergify and removed backport-8.x Automated backport to the 8.x branch with mergify labels Jan 28, 2025

rdner force-pushed the simplified-integration-framework branch 8 times, most recently from c9bcb84 to efd3b0d Compare January 28, 2025 17:00

rdner requested review from belimawr and leehinman January 28, 2025 17:00

rdner force-pushed the simplified-integration-framework branch 7 times, most recently from dafbfd4 to 125a560 Compare January 29, 2025 13:51

rdner marked this pull request as ready for review January 29, 2025 15:43

rdner requested a review from a team as a code owner January 29, 2025 15:43

rdner requested a review from khushijain21 January 30, 2025 12:19

rdner requested a review from VihasMakwana January 30, 2025 12:19

VihasMakwana reviewed Feb 10, 2025

View reviewed changes

libbeat/testing/integration/integration.go Show resolved Hide resolved

rdner requested a review from VihasMakwana February 10, 2025 11:50

VihasMakwana approved these changes Feb 10, 2025

View reviewed changes

belimawr reviewed Feb 10, 2025

View reviewed changes

rdner added 2 commits February 11, 2025 10:11

Add simplified integration framework

e77cfa1

That can be used for running Beats binaries in integration tests. This framework has a simplified API comparing to the existing one and uses a more efficient way to search for logs in the output of the command.

Address review feedback: docs, code changes

7ec435f

rdner force-pushed the simplified-integration-framework branch from 125a560 to 7ec435f Compare February 11, 2025 09:11

rdner requested a review from belimawr February 11, 2025 09:13

Rebuild outdated binaries

bb39651

rdner force-pushed the simplified-integration-framework branch from 31bb31d to bb39651 Compare February 11, 2025 12:22

rdner added 2 commits February 11, 2025 14:56

Fix occasional corrupted output

a96d68e

Move the channel to the right place

e6fdff1

AndersonQ approved these changes Feb 11, 2025

View reviewed changes

rdner added 2 commits February 11, 2025 16:49

Remove the extra log

d956ba7

Clarify the docs

4e88b1f

belimawr approved these changes Feb 11, 2025

View reviewed changes

rdner merged commit a6ab04f into elastic:main Feb 11, 2025
143 checks passed

rdner deleted the simplified-integration-framework branch February 11, 2025 18:17

rdner added backport-9.0 Automated backport to the 9.0 branch and removed backport-skip Skip notification from the automated backport with mergify labels Feb 12, 2025

mergify bot mentioned this pull request Feb 12, 2025

[9.0](backport #42450) Add simplified integration framework #42678

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add simplified integration framework #42450

Add simplified integration framework #42450

rdner commented Jan 28, 2025 •

edited

Loading

mergify bot commented Jan 28, 2025

mergify bot commented Jan 28, 2025

elasticmachine commented Jan 29, 2025

belimawr left a comment

belimawr Feb 10, 2025

rdner Feb 11, 2025

belimawr Feb 10, 2025

belimawr Feb 10, 2025

rdner Feb 11, 2025

belimawr Feb 11, 2025

rdner commented Feb 11, 2025

rdner commented Feb 11, 2025

		cfgPath := filepath.Join(t.TempDir(), fmt.Sprintf("%s.yml", opts.Beatname))
		homePath := filepath.Join(t.TempDir(), "home")

Add simplified integration framework #42450

Add simplified integration framework #42450

Conversation

rdner commented Jan 28, 2025 • edited Loading

Checklist

Motivation

Pros

Cons

Current functionality

Basic Assertions

Filebeat-specific Assertions

Reporting

Config

mergify bot commented Jan 28, 2025

mergify bot commented Jan 28, 2025

elasticmachine commented Jan 29, 2025

belimawr left a comment

Choose a reason for hiding this comment

belimawr Feb 10, 2025

Choose a reason for hiding this comment

rdner Feb 11, 2025

Choose a reason for hiding this comment

belimawr Feb 10, 2025

Choose a reason for hiding this comment

belimawr Feb 10, 2025

Choose a reason for hiding this comment

rdner Feb 11, 2025

Choose a reason for hiding this comment

belimawr Feb 11, 2025

Choose a reason for hiding this comment

rdner commented Feb 11, 2025

rdner commented Feb 11, 2025

rdner commented Jan 28, 2025 •

edited

Loading