[opampsupervisor] configure agent healthcheck port #34643

dpaasman00 · 2024-08-13T11:30:22Z

Component(s)

cmd/opampsupervisor

Is your feature request related to a problem? Please describe.

I'd like to programmatically determine if the agent is healthy after being started by the supervisor without using an opamp connection to the supervisor. I want to do this by checking the agent's healthcheck extension endpoint. This would be particularly useful when updating an existing agent to be ran with the supervisor. Currently the port picked by the supervisor is non-deterministic making it difficult to determine which port should be targeted. I'd like to change this by making the healthcheck extension port configurable.

Describe the solution you'd like

The port assigned to the agent's healthcheck extension should be configurable in the supervisor config. A new parameter in the agent configuration section, maybe even in agent.description.

Describe alternatives you've considered

No response

Additional context

The relevant code for picking a random port.

The text was updated successfully, but these errors were encountered:

github-actions · 2024-08-13T11:30:38Z

Pinging code owners:

cmd/opampsupervisor: @evan-bradley @atoulme @tigrannajaryan @BinaryFissionGames

See Adding Labels via Comments if you do not have permissions to add labels yourself.

dpaasman00 · 2024-08-13T12:18:28Z

I'd like to take this on if this is wanted

tigrannajaryan · 2024-08-13T14:06:12Z

The port assigned to the agent's healthcheck extension should be configurable in the supervisor config.

Please add the motivation for this request to the description. It is not clear why this is needed. What problem is this solving? Why does the port need to be user selectable?

dpaasman00 · 2024-08-13T14:31:15Z

@tigrannajaryan Updated the description, let me know if I need to add additional context.

BinaryFissionGames · 2024-08-15T13:26:32Z

I think we want the port to be configurable regardless of actually checking it outside the supervisor, grabbing a random port is error-prone in that:

Between generating the port and binding to the port, a different process may bind to that port
The supervisor may choose a random port that conflicts with another application that starts after the supervisor.

These are both rare scenarios, but with enough time + users I could imagine these things happening more than once.

tigrannajaryan · 2024-08-15T14:59:52Z

Between generating the port and binding to the port, a different process may bind to that port

This indeed can happen. A possible fix is instead of Supervisor choosing a port, the healthcheck extension can choose a port and the opamp extension will report this port to Supervisor via effective config. Not entirely clear how opamp extension will learn about the port number though.

tigrannajaryan · 2024-08-15T17:01:38Z

@tigrannajaryan Updated the description, let me know if I need to add additional context.

Thanks, looks good.

**Description:** <Describe what has changed.>  Add a new configuration parameter to `agent` called `health_check_port`. If this is set, then the supervisor will configure the agent's healthcheck extension to use the given port. If it is unset, then we will grab a random port same as before. **Link to tracking Issue:** #34643 **Testing:** <Describe what testing was performed and which tests were added.> - Updated config validation tests - Verified that healthcheck extension is configured with the correct port and works as expected

…elemetry#34704) **Description:** <Describe what has changed.>  Add a new configuration parameter to `agent` called `health_check_port`. If this is set, then the supervisor will configure the agent's healthcheck extension to use the given port. If it is unset, then we will grab a random port same as before. **Link to tracking Issue:** open-telemetry#34643 **Testing:** <Describe what testing was performed and which tests were added.> - Updated config validation tests - Verified that healthcheck extension is configured with the correct port and works as expected

dpaasman00 added enhancement New feature or request needs triage New item requiring triage labels Aug 13, 2024

github-actions bot added the cmd/opampsupervisor label Aug 13, 2024

Frapschen assigned dpaasman00 Aug 15, 2024

Frapschen removed the needs triage New item requiring triage label Aug 15, 2024

dpaasman00 mentioned this issue Aug 15, 2024

[opampsupervisor] Add HealthCheckPort configuration parameter #34704

Merged

github-actions bot mentioned this issue Aug 20, 2024

Weekly Report: 2024-08-13 - 2024-08-20 #34743

Closed

dpaasman00 closed this as completed Sep 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[opampsupervisor] configure agent healthcheck port #34643

[opampsupervisor] configure agent healthcheck port #34643

dpaasman00 commented Aug 13, 2024 •

edited

Loading

github-actions bot commented Aug 13, 2024

dpaasman00 commented Aug 13, 2024

tigrannajaryan commented Aug 13, 2024

dpaasman00 commented Aug 13, 2024

BinaryFissionGames commented Aug 15, 2024

tigrannajaryan commented Aug 15, 2024

tigrannajaryan commented Aug 15, 2024

[opampsupervisor] configure agent healthcheck port #34643

[opampsupervisor] configure agent healthcheck port #34643

Comments

dpaasman00 commented Aug 13, 2024 • edited Loading

Component(s)

Is your feature request related to a problem? Please describe.

Describe the solution you'd like

Describe alternatives you've considered

Additional context

github-actions bot commented Aug 13, 2024

dpaasman00 commented Aug 13, 2024

tigrannajaryan commented Aug 13, 2024

dpaasman00 commented Aug 13, 2024

BinaryFissionGames commented Aug 15, 2024

tigrannajaryan commented Aug 15, 2024

tigrannajaryan commented Aug 15, 2024

dpaasman00 commented Aug 13, 2024 •

edited

Loading