Normalize character strings for IPI-style matching
normalize_for_ipi.RdNormalizes UTF-8 character strings to a deterministic, ASCII-only, uppercase representation suitable for identifier-style matching and comparison (e.g. CISAC IPI–style name matching).
Details
The normalization:
transliterates accented Latin characters,
applies deterministic Cyrillic-to-Latin transliteration aligned with common CISAC / CMO practice,
removes punctuation and non-alphanumeric characters,
standardizes whitespace,
preserves pipe-separated name variants.
This function produces IPI-style normalized strings for internal matching and reconciliation. It does not generate official CISAC IPI Names and carries no CISAC or ISO authority.