-
Notifications
You must be signed in to change notification settings - Fork 149
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature: Read multiple worksheets (of same structure) at once #938
Comments
Hi Christian, there were multiple requests regarding reading multiple sheets from an Excel file. |
|
from implementation perspective I am wondering if this is the place to start:
"Just" loop over all matching sheets and return a single iterator as beofre. The only caveat I see is that on the 2nd sheet and later we need to skip the header line. Your thoughts on this? Or would you see a better approach? |
That looks like the correct entrypoint 👍 Then it would be a good time to think about testing: |
Am I using the newest version of the library?
Is there an existing issue for this?
Current Behavior
As of now I need to define the exact sheetName or provide a sheet index number to read data (via
dataAddress
).Expected Behavior
Idea is to provide a regex as
dataAddress
, e.g. "sheet_[0-9]+" and then all sheets that match the regex are read and returned as single Dataframe. All such sheets must match the same dataschema / structure. To mark the sheet name as regex we could surrounded the regex e.g. by "[...]" since [ is not a valid sheet name character).As an alternative one could implement a function to query all the sheet names, find the matching ones, then call the dataframereader for each sheet individually.
@nightscape , would that be a feature that fit spark-excel? If so, any thoughts about the implementation?
Best regards
Christian
Steps To Reproduce
No response
Environment
Anything else?
No response
The text was updated successfully, but these errors were encountered: