Crates.io | ixdtf |
lib.rs | ixdtf |
version | 0.2.0 |
source | src |
created_at | 2023-01-26 18:19:40.823858 |
updated_at | 2024-05-28 20:25:01.850849 |
description | Parser for Internet eXtended DateTime Format |
homepage | https://icu4x.unicode.org |
repository | https://github.com/unicode-org/icu4x |
max_upload_size | |
id | 768766 |
size | 114,682 |
Parsers for extended date time string and Duration parsing.
The Internet Extended Date/Time Fmt (IXDTF) is laid out by RFC 9557. RFC 9557 builds on RFC3339's time stamp specification and ISO8601 to provide an optional extension syntax for date/time strings. RFC 9557 also updates RFC3339 "in the specific interpretation of the local offset Z".
2024-03-02T08:48:00-05:00[America/New_York]
2024-03-02T08:48:00-05:00[-05:00]
2024-03-02T08:48:00-05:00[u-ca=iso8601]
use ixdtf::parsers::{
records::{Sign, TimeZoneRecord},
IxdtfParser,
};
let ixdtf_str = "2024-03-02T08:48:00-05:00[America/New_York]";
let result = IxdtfParser::new(ixdtf_str).parse().unwrap();
let date = result.date.unwrap();
let time = result.time.unwrap();
let offset = result.offset.unwrap();
let tz_annotation = result.tz.unwrap();
assert_eq!(date.year, 2024);
assert_eq!(date.month, 3);
assert_eq!(date.day, 2);
assert_eq!(time.hour, 8);
assert_eq!(time.minute, 48);
assert_eq!(offset.sign, Sign::Negative);
assert_eq!(offset.hour, 5);
assert_eq!(offset.minute, 0);
assert!(!tz_annotation.critical);
assert_eq!(tz_annotation.tz, TimeZoneRecord::Name("America/New_York"));
The extended suffixes laid out by RFC 9557 are optional, so the IxdtfParser
will also still parse any valid date time strings described by RFC3339.
Example Valid Date Time Strings:
2024-03-02
+002024-03-02
20240302
+0020240302
2024-03-02T08:48:00
2024-03-02T08:48:00
The suffix extensions come in two primary kinds: a time zone annotation and a key-value
annotation. The suffixes may also be flagged as critical with a !
as a leading flag
character.
Time zone annotations can be either a valid IANA time zone name or numeric offset.
2024-03-02T08:48:00-5:00[America/New_York]
2024-03-02T08:48:00-5:00[-05:00]
Key-value pair annotations are any key and value string separated by a '=' character.
Key-value pairs are can include any information. Keys may be permanently registered,
provisionally registered, or unknown; however, only permanent keys are acted on by
IxdtfParser
.
If duplicate registered keys are provided the first key will be returned, unless one
of the duplicate annotations is marked as critical, in which case an error may be
thrown by the ixdtf
(See Invalid Annotations for more
information).
u-ca
2024-03-02T08:48:00-05:00[America/New_York][u-ca=iso8601]
2024-03-02T08:48:00-05:00[u-ca=iso8601][u-ca=japanese]
2024-03-02T08:48:00-05:00[u-ca=iso8601][!u-ca=iso8601]
2024-03-02T08:48:00-05:00[u-ca=iso8601][answer-to-universe=fortytwo]
This is a basic annotation string that has a Time Zone and calendar annotation.
This example is duplicate and different calendar annotations, but neither calendar is flagged as critical so the first calendar is returned while the second calendar is ignored.
This example is a duplicate and identical calendar annotations with one annotation flagged as critical. As the annotations are identical values, there is no ambiguity with the use of the critical flag that may cause an error. Thus, the first annotation is returned, and the second is ignored (See Annotations with Application Defined Behavior).
This example contains an unknown annotation. The annotation is not marked as critical so the value is ignored (See Implementing Annotation Handlers).
The below ixdtf
strings have invalid annotations that will cause an error
to be thrown (NOTE: these are not to be confused with potentially invalid
annotations with application defined behavior).
2024-03-02T08:48:00-05:00[u-ca=iso8601][America/New_York]
2024-03-02T08:48:00-05:00[u-ca=iso8601][!u-ca=japanese]
2024-03-02T08:48:00-05:00[u-ca=iso8601][!answer-to-universe=fortytwo]
This example shows a Time Zone annotation that is not currently in the correct
order with the key value. When parsing this invalid annotation, ixdtf
will attempt to parse the Time Zone annotation as a key-value annotation.
use ixdtf::{parsers::IxdtfParser, ParserError};
let example_one =
"2024-03-02T08:48:00-05:00[u-ca=iso8601][America/New_York]";
let mut ixdtf = IxdtfParser::new(example_one);
let result = ixdtf.parse();
assert_eq!(result, Err(ParserError::AnnotationKeyLeadingChar));
This example shows a duplicate registered key; however, in this case, one of the registered keys is flagged as critical, which throws an error as the ixdtf string must be treated as erroneous
use ixdtf::{parsers::IxdtfParser, ParserError};
let example_one = "2024-03-02T08:48:00-05:00[u-ca=iso8601][!u-ca=japanese]";
let result = IxdtfParser::new(example_one).parse();
assert_eq!(result, Err(ParserError::CriticalDuplicateCalendar));
This example shows an unknown key flagged as critical. ixdtf
will return an
error on an unknown flag being flagged as critical.
use ixdtf::{parsers::IxdtfParser, ParserError};
let example_one =
"2024-03-02T08:48:00-05:00[u-ca=iso8601][!answer-to-universe=fortytwo]";
let result = IxdtfParser::new(example_one).parse();
assert_eq!(result, Err(ParserError::UnrecognizedCritical));
The below options may be viewed as valid or invalid depending on application defined
behavior. Where user defined behavior might be required, the ixdtf
crate applies
the logic in the least restrictive interpretation and provides optional callbacks
for the user to define stricter behavior.
2024-03-02T08:48:00-05:00[u-ca=japanese][!u-ca=japanese]
2024-03-02T08:48:00+01:00[America/New_York]
This example shows a critical duplicate calendar where the annotation value is identical. RFC 9557 is
ambiguous on whether this should be rejected for inconsistency. ixdtf
treats these values
as consistent, and, therefore, okay. However, an application may wish to handle this duplicate
critical calendar value as inconsistent (See Implementing Annotation Handlers).
This example shows an ambiguous Time Zone caused by a misalignment of the offset and the Time Zone annotation. It is up to the user to handle this ambiguity between the offset and annotation.
use ixdtf::parsers::{IxdtfParser, records::TimeZoneRecord};
let example_one = "2024-03-02T08:48:00+01:00[!America/New_York]";
let mut ixdtf = IxdtfParser::new(example_one);
let result = ixdtf.parse().unwrap();
let tz_annotation = result.tz.unwrap();
let offset = result.offset.unwrap();
// The time zone annotation and offset conflict with each other, and must therefore be
// resolved by the user.
assert!(tz_annotation.critical);
assert_eq!(tz_annotation.tz, TimeZoneRecord::Name("America/New_York"));
assert_eq!(offset.hour, 1);
As mentioned in the prior section, there may be times where an application may
need to implement application defined behavior for user defined functionality.
In this instance, ixdtf
provides a *_with_annotation_handler
method that
allows to the user to provide a callback.
A handler is defined as handler: impl FnMut(Annotation<'a>) -> Option<Annotation<'a>>
where ixdtf
provides visibility to an annotation to the user. The call to this callback
occurs prior to the ixdtf
's processing of the annotation, and will only occur if the
annotation is provided back to ixdtf
.
If the user wishes to ignore any ixdtf
's errors, then they may return None
, which
results in a no-op for that annotation.
Unless the user’s application has a specific reason to bypass action on an annotation, such as, custom unknown key handling or superceding a calendar based on it’s critical flag, it is recommended to return the annotation value.
A user may wish to implement a custom key in an annotation set. This can be completed with custom handler.
use ixdtf::parsers::IxdtfParser;
let example_with_custom_key = "2024-03-02T08:48:00-05:00[u-ca=iso8601][!answer-to-universe=fortytwo]";
let mut answer = None;
let _ = IxdtfParser::new(example_with_custom_key).parse_with_annotation_handler(|annotation| {
if annotation.key == "answer-to-universe" {
answer.get_or_insert(annotation);
// Found our value! We don't need `ixdtf` to handle this annotation.
return None
}
// The annotation is not our custom annotation, so we return
// the value back for regular logic.
Some(annotation)
}).unwrap();
let answer = answer.unwrap();
assert!(answer.critical);
assert_eq!(answer.value, "fortytwo");
It is worth noting that in the above example the annotation above found is a critically flagged
unknown key. RFC 9557 and ixdtf
considers unknown critical keys as invalid. However, handlers
allow the user to define any known keys of their own and therefore also handle the logic around
criticality.
Additional resources for Date and Time string grammar can be found in RFC3339 and the Temporal proposal.
The ixdtf
crate also implements an ISO8601 Duration parser (IsoDurationParser
) that is available under
the duration
feature flag. The API for IsoDurationParser
is the same as IxdtfParser
, but
parses duration strings over date/time strings.
For more information on development, authorship, contributing etc. please visit ICU4X home page
.