Reference
Public API of tidysdmx — fetch SDMX schemas from FMR, infer schemas from tidy DataFrames, build structure maps from Excel templates, map data to dissemination schemas, validate datasets, and emit SDMX-ML artefacts ready for FMR upload.
FMR & Schemas
Fetch and parse SDMX schemas from a Fusion Metadata Registry.
- fetch_schema()
-
Fetch the schema of a specified artefact from an SDMX registry.
- fetch_dsd_schema()
-
Fetch a DSD schema from a Fusion Metadata Registry (FMR).
- parse_artefact_id()
-
Parse an artefact identifier into its components: agency, id and version.
- parse_dsd_id()
-
Parse a DSD identifier into its components.
- create_schema_from_table()
-
Create a DSD, ConceptScheme, and Codelists from a DataFrame.
Structure Maps
Build, parse, validate, and write SDMX structure maps.
- parse_mapping_template_wb()
-
Read an Excel mapping template and return all sheets as DataFrames.
- build_structure_map_from_template_wb()
-
Build a complete StructureMap object by parsing a WB-format Excel template.
- build_fixed_map()
-
Build a pysdmx FixedValueMap for setting a component to a fixed value.
- build_implicit_component_map()
-
Build a pysdmx ImplicitComponentMap for implicit mapping rules.
- build_date_pattern_map()
-
Build a DatePatternMap object for mapping date patterns between SDMX components.
- build_value_map()
-
Create a pysdmx ValueMap object mapping a source value to a target value.
- build_value_map_list()
-
Build a list of ValueMap objects from a pandas DataFrame, optionally including validity periods.
- build_multi_value_map_list()
-
Build a list of MultiValueMap objects from a pandas DataFrame.
- build_representation_map()
-
Build a RepresentationMap object from a pandas DataFrame using build_value_map_list.
- build_multi_representation_map()
-
Build a MultiRepresentationMap object from a pandas DataFrame.
- build_single_component_map()
-
Build a ComponentMap mapping one source component to one target component using a RepresentationMap built from a pandas DataFrame.
- collect_structure_map_artifacts()
-
Collect the StructureMap and all its dependent RepresentationMaps.
- validate_structure_map_references()
-
Validate that all RepresentationMap references are resolved.
- prepare_structure_map_for_upload()
-
Prepare a StructureMap for upload by collecting all dependencies.
Mapping
Apply structure maps to tidy DataFrames.
- map_structures()
-
Apply all mapping components from a StructureMap to a DataFrame.
- apply_fixed_value_maps()
-
Apply FixedValueMap rules to a DataFrame.
- apply_implicit_component_maps()
-
Apply ImplicitComponentMap rules to a DataFrame.
- apply_multi_component_map()
-
Apply a single MultiComponentMap with regex support, preserving rule order.
- map_to_sdmx()
-
Map DataFrame columns to SDMX values using a lookup mapping.
- transform_source_to_target()
-
Transform a raw DataFrame into the format defined by a components map.
Standardisation
Prepare a mapped DataFrame for SDMX upload.
- standardize_output()
-
Standardize the output DataFrame by adding SDMX reference columns.
- standardize_sdmx()
-
Standardize a DataFrame by applying column and value transformations.
- standardize_data_for_upload()
-
Standardize a DataFrame for SDMX upload.
- standardize_indicator_id()
-
Fix the INDICATOR column to be uppercase and prefixed with dataset ID.
- sanitize_variable()
-
Sanitize a raw string value into a valid SDMX code ID.
- add_sdmx_reference_cols()
-
Add SDMX reference columns to a DataFrame.
Validation
Validate datasets against schemas and codelists.
- validate_dataset_local()
-
Validate that a DataFrame is SDMX compliant and return a DataFrame of errors.
- validate_columns()
-
Validate that all DataFrame columns are valid components or SDMX references.
- validate_mandatory_columns()
-
Validate that all mandatory columns are present in the DataFrame.
- validate_codelist_ids()
-
Validate that all values in coded columns are within the allowed codelist IDs.
- validate_duplicates()
-
Validate that there are no duplicate rows for a given set of key columns.
- validate_no_missing_values()
-
Validate that there are no missing values in mandatory columns.
Tidy Raw
Filter and shape raw inputs.
- filter_tidy_raw()
-
Filter an SDMX DataFrame by removing rows that violate codelist constraints.
- filter_rows()
-
Filter out rows where values are not in the allowed codelist.
Utilities
Helpers for codelists, components, Excel templates, and XML.
- extract_validation_info()
-
Extract validation information from a given schema.
- get_codelist_ids()
-
Retrieve all codelist IDs for given coded components.
- extract_component_ids()
-
Retrieve all component IDs from a given pysdmx Schema.
- create_mapping_rules()
-
Create Excel hyperlink formulas for components with representation maps.
- build_excel_workbook()
-
Build a Workbook with component mapping and representation map sheets.
- write_excel_mapping_template()
-
Generate an Excel mapping template with component and representation tabs.
- read_mapping()
-
Read a JSON mapping file and parse its content into DataFrames.
- fix_sdmx_xml_datatype_tags()
-
Fix incorrect SourceCodelist/TargetCodelist tags in SDMX-ML.
- gen_urn()
-
Generate a full SDMX URN for any maintainable artefact.
QA
Quality-assurance helpers.
- qa_coerce_numeric()
-
Coerce specified columns to numeric, removing rows with invalid values.
- qa_remove_duplicates()
-
Remove duplicate rows from a DataFrame.
Kedro Integration
Kedro pipeline node wrappers.
- kd_read_mappings()
-
Fetch multiple mappings from different files.
- kd_standardize_sdmx()
-
Standardize a partitioned dataset into SDMX format.
- kd_validate_dataset_local()
-
Validate a single DataFrame for SDMX compliance.
- kd_validate_datasets_local()
-
Validate multiple datasets for SDMX compliance.
Lookups
Vectorised lookup helpers.
- vectorized_lookup_ordered_v1()
-
Apply ordered regex matching to a Pandas Series.
- vectorized_lookup_ordered_v2()
-
Apply ordered matching (regex or exact) to a Pandas Series.