Adopt microsecond precision in num2date and date2num #184

rcomer · 2021-06-30T17:55:20Z

🚀 Pull Request

Description

The _discard_microsecond and _num2date_to_nearest_second functions were introduced at #66 and ensure that we always work at second precision. The stated reason for this was

Allowing microsecond precision introduces rounding errors.

Since then, we have switched to using cftime under the hood (#85) and a quick search of the cftime repo suggests that they have worked on getting their functions working at microsecond precision (e.g. Unidata/cftime#187, Unidata/cftime#171). So it seems plausible that the rounding errors mentioned in #66 may no longer be a problem. This PR simply removes the microsecond handling from our functions to see which tests fail. The only failures are for tests that specifically use fractions of a second, and the values produced all look sensible to me:

Values produced within failing tests

test_discard_mircosecond

date = datetime.datetime(1970, 1, 1, 0, 0, 5, 750000)
cf_units.date2num(date, "seconds since 1970-01-01", "gregorian")

gives

5.75

test_fractional_second_*

for calendar in ['gregorian', '360_day', '365_day']:
    useconds = cf_units.Unit("seconds since 1970-01-01", calendar)
    print(useconds.num2date([0.25, 0.5, 0.75, 1.5, 2.5, 3.5, 4.5]))
    print('---')

gives

[cftime.DatetimeGregorian(1970, 1, 1, 0, 0, 0, 250000, has_year_zero=False)
 cftime.DatetimeGregorian(1970, 1, 1, 0, 0, 0, 500000, has_year_zero=False)
 cftime.DatetimeGregorian(1970, 1, 1, 0, 0, 0, 750000, has_year_zero=False)
 cftime.DatetimeGregorian(1970, 1, 1, 0, 0, 1, 500000, has_year_zero=False)
 cftime.DatetimeGregorian(1970, 1, 1, 0, 0, 2, 500000, has_year_zero=False)
 cftime.DatetimeGregorian(1970, 1, 1, 0, 0, 3, 500000, has_year_zero=False)
 cftime.DatetimeGregorian(1970, 1, 1, 0, 0, 4, 500000, has_year_zero=False)]
---
[cftime.Datetime360Day(1970, 1, 1, 0, 0, 0, 250000, has_year_zero=False)
 cftime.Datetime360Day(1970, 1, 1, 0, 0, 0, 500000, has_year_zero=False)
 cftime.Datetime360Day(1970, 1, 1, 0, 0, 0, 750000, has_year_zero=False)
 cftime.Datetime360Day(1970, 1, 1, 0, 0, 1, 500000, has_year_zero=False)
 cftime.Datetime360Day(1970, 1, 1, 0, 0, 2, 500000, has_year_zero=False)
 cftime.Datetime360Day(1970, 1, 1, 0, 0, 3, 500000, has_year_zero=False)
 cftime.Datetime360Day(1970, 1, 1, 0, 0, 4, 500000, has_year_zero=False)]
---
[cftime.DatetimeNoLeap(1970, 1, 1, 0, 0, 0, 250000, has_year_zero=True)
 cftime.DatetimeNoLeap(1970, 1, 1, 0, 0, 0, 500000, has_year_zero=True)
 cftime.DatetimeNoLeap(1970, 1, 1, 0, 0, 0, 750000, has_year_zero=True)
 cftime.DatetimeNoLeap(1970, 1, 1, 0, 0, 1, 500000, has_year_zero=True)
 cftime.DatetimeNoLeap(1970, 1, 1, 0, 0, 2, 500000, has_year_zero=True)
 cftime.DatetimeNoLeap(1970, 1, 1, 0, 0, 3, 500000, has_year_zero=True)
 cftime.DatetimeNoLeap(1970, 1, 1, 0, 0, 4, 500000, has_year_zero=True)]

So I can't see any evidence from the cf_units tests that rounding is still a problem.

If we were to strip out these functions and revert to cftime's microsecond precision, we would benefit from a simpler, easier to maintain codebase.

Thoughts?

bjlittle · 2023-04-19T09:32:50Z

@rcomer Let's rebase and respin.

I'm happy to then test this with iris to see if there are any side-effects.

Sound like a reasonable plan? 🤔

rcomer · 2023-04-19T10:22:46Z

Thanks @bjlittle I have rebased and also removed your retrospectively added tests - I forgot this branch predated those, but you just can't test what isn't there any more!

Remaining test failures appear to be as before i.e. expecting a whole number of seconds out when a fraction of seconds is passed in.

rcomer · 2023-04-19T14:11:08Z

I ran the Iris tests locally under py3.10 with this branch pip-installed. All passed ✅👍

bjlittle · 2023-04-19T15:31:11Z

@rcomer Okay, so this seems like a reasonable time to polish and bank this. I'm convinced that we should go for the removal of the rounding behaviour.

Since there is no downstream side-effects from testing with iris, that's enough for me.

I removed the decision required label, so go for it! 🚀

Simply show the PR some love, and extend the test coverage enough to convince ourselves that the cftime sub-second implementation is stable and I'll happily review and merge 👍

You happy to do that?

cf_units/tests/integration/test__Unit_date2num.py

rcomer · 2023-04-20T10:12:42Z

OK I think this might be there. I'm a bit unclear where the tests should be but they are easy enough to block move if needed. Can confirm that Iris tests still pass with latest version of this branch.

bjlittle

@rcomer Awesome refactor.

Thanks for having the patience of a saint 👼

rcomer marked this pull request as draft June 30, 2021 17:55

rcomer marked this pull request as ready for review July 7, 2021 21:35

rcomer marked this pull request as draft July 7, 2021 21:36

rcomer added Status: Decision Required Type: Tech Debt labels Apr 18, 2023

bjlittle self-assigned this Apr 19, 2023

rcomer added 2 commits April 19, 2023 10:41

comment out microsecond handling

3753a83

remove microsecond handling

ae2fe3b

rcomer force-pushed the strip-microsecond-handling branch from ce4717e to ae2fe3b Compare April 19, 2023 09:44

remove _discard_microsecond unit tests

563a66d

bjlittle self-requested a review April 19, 2023 15:21

bjlittle removed the Status: Decision Required label Apr 19, 2023

rcomer added 3 commits April 19, 2023 16:51

update expected values in test__num2date_to_nearest_second.py

0f81c73

update expected value in date2num test

2b389bf

test Unit.num2date instead of _num2date_to_nearest_second

7e67d48

rcomer force-pushed the strip-microsecond-handling branch from 2783cec to 6aed27e Compare April 19, 2023 16:59

rcomer added 6 commits April 20, 2023 09:53

purge _num2date_to_nearest_second

9479316

simplify Unit.num2date

3a16989

rename num2date test module

39ba128

add gregorian date2num fractional test

ee697b0

parametrize date2num fractional test

f10c9ab

test a longer time interval; put expected vals after input

3983b79

rcomer force-pushed the strip-microsecond-handling branch from 6aed27e to 3983b79 Compare April 20, 2023 10:03

rcomer changed the title ~~Discussion: consider adopting microsecond precision in num2date and date2num~~ Adopt microsecond precision in num2date and date2num Apr 20, 2023

rcomer commented Apr 20, 2023

View reviewed changes

cf_units/tests/integration/test__Unit_date2num.py Show resolved Hide resolved

rcomer marked this pull request as ready for review April 20, 2023 10:11

bjlittle approved these changes Apr 21, 2023

View reviewed changes

bjlittle merged commit b662175 into SciTools:main Apr 21, 2023

rcomer deleted the strip-microsecond-handling branch April 21, 2023 13:15

rcomer mentioned this pull request May 11, 2023

cftime warnings on pp-load SciTools/iris#5315

Closed

rcomer mentioned this pull request Nov 4, 2024

[CI Bot] environment lockfiles auto-update SciTools/iris#6215

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adopt microsecond precision in num2date and date2num #184

Adopt microsecond precision in num2date and date2num #184

rcomer commented Jun 30, 2021

bjlittle commented Apr 19, 2023

rcomer commented Apr 19, 2023

rcomer commented Apr 19, 2023

bjlittle commented Apr 19, 2023

rcomer commented Apr 20, 2023 •

edited

Loading

bjlittle left a comment

Adopt microsecond precision in num2date and date2num #184

Adopt microsecond precision in num2date and date2num #184

Conversation

rcomer commented Jun 30, 2021

🚀 Pull Request

Description

test_discard_mircosecond

test_fractional_second_*

bjlittle commented Apr 19, 2023

rcomer commented Apr 19, 2023

rcomer commented Apr 19, 2023

bjlittle commented Apr 19, 2023

rcomer commented Apr 20, 2023 • edited Loading

bjlittle left a comment

Choose a reason for hiding this comment

rcomer commented Apr 20, 2023 •

edited

Loading