[ML] File upload api refactor #210865

jgowdyelastic · 2025-02-12T15:37:09Z

Adds a v2 version of the file upload api which spits away the upload initialisation step from the data upload api.
Previously the import data API would behave differently depending on whether an ID was passed to it. If an ID was not present, the api would "initialize the upload" by creating the index, mappings and pipeline.
Subsequent calls to the api would the pass in an ID as well as the data. The ID being present meant the data would be ingested.
The ID had not other purpose other than signifying whether this was the initial call to create the index or the subsequent calls to ingest the data.
This change adds a new initialize_import api which is called first to create the index et al.
Subsequent calls to the import api behave as before and the data is ingested.

A temporary v1 version of the import has been kept for backwards compatibility during upgrades.

The initialize_import also creates multiple ingest pipelines by default. Improving the previous "hacked in" addition of having two sets of pipelines passed to it to provide backwards compatibility.

…ibana into file-upload-api-refactor

elasticmachine · 2025-02-17T20:55:02Z

Pinging @elastic/ml-ui (:ml)

qn895 · 2025-02-20T23:21:48Z

x-pack/platform/plugins/private/file_upload/public/importer/routes.ts

+  IndicesIndexSettings,
+  MappingTypeMapping,
+} from '@elastic/elasticsearch/lib/api/typesWithBodyKey';
+import {


We can use import type here and above

Good spot, I'm too used to the consistent-type-imports eslint rule we have in our other plugins.
I think I'll add that rule to the file_upload plugin too in a separate PR.

Updated in 3b340af

qn895 · 2025-02-20T23:22:20Z

x-pack/platform/plugins/private/file_upload/server/import_data.ts

@@ -13,56 +13,72 @@ import type {
  MappingTypeMapping,
 } from '@elastic/elasticsearch/lib/api/typesWithBodyKey';
 import { INDEX_META_DATA_CREATED_BY } from '../common/constants';
-import { ImportResponse, ImportFailure, InputData, IngestPipelineWrapper } from '../common/types';
+import {


We can use import type here

Updated in 3b340af

qn895 · 2025-02-20T23:24:54Z

x-pack/platform/plugins/private/file_upload/server/schemas.ts

  settings: schema.maybe(schema.any()),
  /** Mappings */
  mappings: schema.any(),
  /** Ingest pipeline definition */
-  ingestPipeline,
-  createPipelines: schema.maybe(schema.arrayOf(ingestPipeline)),
+  ingestPipelines: schema.arrayOf(ingestPipeline),


Would ingestPipelines ever be a schema.maybe(schema.arrayOf(ingestPipeline))? Or is it always empty array if there's no ingest pipeline?

The latter. It should always be present and just empty if there are no pipelines to create.

qn895

Tested and LGTM 🎉

elasticmachine · 2025-02-26T10:00:02Z

💚 Build Succeeded

Buildkite Build
Commit: 068fa0f

Metrics [docs]

Module Count

Fewer modules leads to a faster build time

id	before	after	diff
`fileUpload`	317	318	+1

Public APIs missing comments

Total count of every public API that lacks a comment. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats comments for more detailed information.

id	before	after	diff
`fileUpload`	96	93	-3

Async chunks

Total size of all lazy-loaded chunks that will be downloaded as the user navigates the app

id	before	after	diff
`dataVisualizer`	616.1KB	616.1KB	+3.0B
`fileUpload`	644.7KB	644.6KB	-96.0B
total			-93.0B

Page load bundle

Size of the bundles that are downloaded on every page load. Target size is below 100kb

id	before	after	diff
`fileUpload`	14.9KB	15.0KB	+145.0B

Unknown metric groups

API count

id	before	after	diff
`fileUpload`	96	93	-3

History

💔 Build #279209 failed 972e733
💚 Build #279021 succeeded c29536a
💚 Build #278901 succeeded db2d59d
💛 Build #278702 was flaky 5e32334
💛 Build #278325 was flaky 02c6fde

cc @jgowdyelastic

kibanamachine · 2025-02-26T10:39:48Z

Starting backport for target branches: 8.x

https://github.com/elastic/kibana/actions/runs/13541986386

kibanamachine · 2025-02-26T10:45:08Z

💔 All backports failed

Status	Branch	Result
❌	8.x	Backport failed because of merge conflicts

Manual backport

To create the backport manually run:

node scripts/backport --pr 210865

Questions ?

Please refer to the Backport tool documentation

jgowdyelastic · 2025-02-26T14:02:27Z

💚 All backports created successfully

Status	Branch	Result
✅	8.x

Note: Successful backport PRs will be merged automatically after passing CI.

Questions ?

Please refer to the Backport tool documentation

Adds a v2 version of the file upload api which spits away the upload initialisation step from the data upload api. Previously the import data API would behave differently depending on whether an ID was passed to it. If an ID was not present, the api would "initialize the upload" by creating the index, mappings and pipeline. Subsequent calls to the api would the pass in an ID as well as the data. The ID being present meant the data would be ingested. The ID had not other purpose other than signifying whether this was the initial call to create the index or the subsequent calls to ingest the data. This change adds a new `initialize_import` api which is called first to create the index et al. Subsequent calls to the `import` api behave as before and the data is ingested. A temporary v1 version of the `import` has been kept for backwards compatibility during upgrades. The `initialize_import` also creates multiple ingest pipelines by default. Improving the previous "hacked in" addition of having two sets of pipelines passed to it to provide backwards compatibility. (cherry picked from commit 0121f4b)

# Backport This will backport the following commits from `main` to `8.x`: - [[ML] File upload api refactor (#210865)](#210865)  ### Questions ? Please refer to the [Backport tool documentation](https://github.com/sorenlouv/backport)

jgowdyelastic added 10 commits February 12, 2025 15:27

[ML] File upload api refactor

0891998

clean up

72e2ff5

Merge branch 'main' into file-upload-api-refactor

6053688

translations

8e023eb

Merge branch 'file-upload-api-refactor' of github.com:jgowdyelastic/k…

c5dc8b1

…ibana into file-upload-api-refactor

Merge remote-tracking branch 'origin/main' into file-upload-api-refactor

b029475

bumping api version

c9efba9

Merge remote-tracking branch 'origin/main' into file-upload-api-refactor

ef58e54

fix for files with no pipeline

bfeafb0

Merge branch 'main' into file-upload-api-refactor

5e17ae7

jgowdyelastic requested review from nreese, peteharverson and qn895 February 17, 2025 20:53

jgowdyelastic self-assigned this Feb 17, 2025

jgowdyelastic added :ml release_note:skip Skip the PR/issue when compiling release notes Feature:File and Index Data Viz ML file and index data visualizer Feature:File Upload v9.1.0 v8.19.0 labels Feb 17, 2025

jgowdyelastic marked this pull request as ready for review February 17, 2025 20:54

jgowdyelastic requested review from a team as code owners February 17, 2025 20:55

Merge branch 'main' into file-upload-api-refactor

44e117f

jgowdyelastic removed the request for review from nreese February 17, 2025 20:55

jgowdyelastic added the backport:version Backport to applied version labels label Feb 18, 2025

jgowdyelastic added 3 commits February 18, 2025 08:42

Merge branch 'main' into file-upload-api-refactor

0a0105d

Merge branch 'main' into file-upload-api-refactor

19e72c4

Merge branch 'main' into file-upload-api-refactor

835d51b

Merge branch 'main' into file-upload-api-refactor

d030a77

qn895 reviewed Feb 20, 2025

View reviewed changes

jgowdyelastic added 5 commits February 21, 2025 10:08

import type

3b340af

Merge branch 'main' into file-upload-api-refactor

02c6fde

Merge branch 'main' into file-upload-api-refactor

5e32334

Merge branch 'main' into file-upload-api-refactor

db2d59d

Merge branch 'main' into file-upload-api-refactor

c29536a

qn895 approved these changes Feb 25, 2025

View reviewed changes

jgowdyelastic added 3 commits February 25, 2025 18:37

Merge branch 'main' into file-upload-api-refactor

972e733

fixing types

b379719

Merge branch 'main' into file-upload-api-refactor

068fa0f

jgowdyelastic merged commit 0121f4b into elastic:main Feb 26, 2025
10 checks passed

jgowdyelastic mentioned this pull request Feb 26, 2025

[8.x] [ML] File upload api refactor (#210865) #212524

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] File upload api refactor #210865

[ML] File upload api refactor #210865

jgowdyelastic commented Feb 12, 2025 •

edited by kibanamachine

Loading

elasticmachine commented Feb 17, 2025

qn895 Feb 20, 2025

jgowdyelastic Feb 21, 2025

jgowdyelastic Feb 21, 2025

jgowdyelastic Feb 21, 2025

qn895 Feb 20, 2025

jgowdyelastic Feb 21, 2025

qn895 Feb 20, 2025

jgowdyelastic Feb 21, 2025

qn895 left a comment

elasticmachine commented Feb 26, 2025

API count

kibanamachine commented Feb 26, 2025

kibanamachine commented Feb 26, 2025

jgowdyelastic commented Feb 26, 2025

[ML] File upload api refactor #210865

[ML] File upload api refactor #210865

Conversation

jgowdyelastic commented Feb 12, 2025 • edited by kibanamachine Loading

elasticmachine commented Feb 17, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

qn895 left a comment

Choose a reason for hiding this comment

elasticmachine commented Feb 26, 2025

💚 Build Succeeded

Metrics [docs]

Module Count

Public APIs missing comments

Async chunks

Page load bundle

API count

History

kibanamachine commented Feb 26, 2025

kibanamachine commented Feb 26, 2025

💔 All backports failed

Manual backport

Questions ?

jgowdyelastic commented Feb 26, 2025

💚 All backports created successfully

Questions ?

jgowdyelastic commented Feb 12, 2025 •

edited by kibanamachine

Loading