[ML] File data visualizer: filter running inference endpoints #196577

jgowdyelastic · 2024-10-16T16:28:26Z

When listing the inference endpoints available for the semantic text field, we should only list ones where the underlying model is deployed, otherwise the ingest can fail due to a timeout as the model is being deployed and an ML node spun up in the background.

This PR adds a check to the data_visualizer/inference_endpoints endpoint to ensure only sparse_embedding and text_embedding types are used and they have at least one allocation.
NOTE, the allocation check is currently commented out waiting on an es change. elastic/elasticsearch#115095

Also renames the endpoint from data_visualizer/inference_services -> data_visualizer/inference_endpoints
And renames variables which were incorrectly named "service" rather than "endpoint"

elasticmachine · 2024-10-17T13:03:42Z

Pinging @elastic/ml-ui (:ml)

…b.com:jgowdyelastic/kibana into file-upload-filter-running-inference-services

jeffvestal · 2024-10-17T19:45:10Z

Another thing that would be nice to filter is only to show inference APIs for embedding services.
However, this may be tricky with external services.

…b.com:jgowdyelastic/kibana into file-upload-filter-running-inference-services

peteharverson

Tested and LGTM.

One observation - in my local testing, uploading a PDF file (500 lines, 160KB) for the first time can take a couple of minutes - presumably as a new .elser-2 deployment is created. During this time, the progress page stays in step 4 (Uploading data) with no visible sign of anything happening. Is there anything we can do to improve the messaging here to provide more details on what is happening?

jgowdyelastic · 2024-10-18T14:38:19Z

@jeffvestal

Another thing that would be nice to filter is only to show inference APIs for embedding services.
However, this may be tricky with external services.

I'm now filtering for sparse_embedding and text_embedding types

jeffvestal · 2024-10-18T14:40:40Z

I'm now filtering for sparse_embedding and text_embedding types

Nice! Thanks

jgowdyelastic · 2024-10-18T14:53:33Z

One observation - in my local testing, uploading a PDF file (500 lines, 160KB) for the first time can take a couple of minutes - presumably as a new .elser-2 deployment is created. During this time, the progress page stays in step 4 (Uploading data) with no visible sign of anything happening. Is there anything we can do to improve the messaging here to provide more details on what is happening?

There will be a change going into es for 8.16 which fixes num_allocations. We can use that to ensure only deployed models are available for use. This will mean the pause you're seeing here for download/deployment will not happen.

In the future we could add a step to trigger this auto deployment by calling the infer endpoint during upload and report the status of the download and deployment as an addition step in the upload progress. #196696

elasticmachine · 2024-10-18T17:23:48Z

💛 Build succeeded, but was flaky

Buildkite Build
Commit: 9044218

Failed CI Steps

FTR Configs #60

Test Failures

[job] [logs] FTR Configs #60 / Rule execution logic API Detection Engine - Execution logic @ess @serverless Indicator match type rules, alert suppression Code execution path: events count is smaller than threats count should suppress an alert on real rule executions

Metrics [docs]

Async chunks

Total size of all lazy-loaded chunks that will be downloaded as the user navigates the app

id	before	after	diff
`dataVisualizer`	614.0KB	614.0KB	+2.0B

History

💚 Build #243572 succeeded f4dfb40

cc @jgowdyelastic

jgowdyelastic added 3 commits October 16, 2024 17:28

[ML] File data visualizer filter running inference services

679824c

improving deployment check and renaming endpoint

72f5984

typo

2dd52c5

jgowdyelastic requested review from darnautov and peteharverson October 17, 2024 13:02

jgowdyelastic self-assigned this Oct 17, 2024

jgowdyelastic added bug Fixes for quality problems that affect the customer experience :ml release_note:skip Skip the PR/issue when compiling release notes v9.0.0 v8.16.0 v8.17.0 Feature:File and Index Data Viz ML file and index data visualizer Feature:File Upload labels Oct 17, 2024

jgowdyelastic marked this pull request as ready for review October 17, 2024 13:03

jgowdyelastic requested a review from a team as a code owner October 17, 2024 13:03

Merge branch 'main' into file-upload-filter-running-inference-services

bedde40

jgowdyelastic mentioned this pull request Oct 17, 2024

[ML] File data visualizer wait for model to be deployed while uploading a file using a semantic text field #196696

Open

jgowdyelastic changed the title ~~[ML] File data visualizer filter running inference services~~ [ML] File data visualizer: filter running inference endpoints Oct 17, 2024

jgowdyelastic added the backport:version Backport to applied version labels label Oct 17, 2024

jgowdyelastic added 4 commits October 17, 2024 14:54

Merge branch 'main' into file-upload-filter-running-inference-services

4f10ba3

fixing deployment check

39c3b4a

Merge branch 'file-upload-filter-running-inference-services' of githu…

f4dfb40

…b.com:jgowdyelastic/kibana into file-upload-filter-running-inference-services

adding endpoint description

40dce06

jgowdyelastic added 2 commits October 18, 2024 09:16

Merge branch 'main' into file-upload-filter-running-inference-services

5b63f8d

Merge branch 'file-upload-filter-running-inference-services' of githu…

729bbe9

…b.com:jgowdyelastic/kibana into file-upload-filter-running-inference-services

peteharverson approved these changes Oct 18, 2024

View reviewed changes

jgowdyelastic marked this pull request as draft October 18, 2024 10:01

peteharverson self-requested a review October 18, 2024 10:33

fitlering endpoints based on type

fdc075e

jgowdyelastic marked this pull request as ready for review October 18, 2024 15:17

Merge branch 'main' into file-upload-filter-running-inference-services

9044218

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] File data visualizer: filter running inference endpoints #196577

[ML] File data visualizer: filter running inference endpoints #196577

jgowdyelastic commented Oct 16, 2024 •

edited

Loading

elasticmachine commented Oct 17, 2024

jeffvestal commented Oct 17, 2024

peteharverson left a comment

jgowdyelastic commented Oct 18, 2024

jeffvestal commented Oct 18, 2024

jgowdyelastic commented Oct 18, 2024 •

edited

Loading

elasticmachine commented Oct 18, 2024

[ML] File data visualizer: filter running inference endpoints #196577

Are you sure you want to change the base?

[ML] File data visualizer: filter running inference endpoints #196577

Conversation

jgowdyelastic commented Oct 16, 2024 • edited Loading

elasticmachine commented Oct 17, 2024

jeffvestal commented Oct 17, 2024

peteharverson left a comment

Choose a reason for hiding this comment

jgowdyelastic commented Oct 18, 2024

jeffvestal commented Oct 18, 2024

jgowdyelastic commented Oct 18, 2024 • edited Loading

elasticmachine commented Oct 18, 2024

💛 Build succeeded, but was flaky

Failed CI Steps

Test Failures

Metrics [docs]

Async chunks

History

jgowdyelastic commented Oct 16, 2024 •

edited

Loading

jgowdyelastic commented Oct 18, 2024 •

edited

Loading