recently I've been asked by some xlocate 1 user who started to migrate to the newer xlocate 2 API about the handling of scores within the geocoding (multifield): he was used to interpret the xlocate 1 total score in a specific manner:
- During the automatic process of geocoding he distinguished between
- "Addresses with a first hit with a total score > xx%" : these addresses are flagged "geocoded"
- "The others" (<=xx% or no hits at all") : these addresses are flagged "to be revised manually"
- Addresses of the second category will then be checked by a user before they can be used in the succeeding processes.
- Some addresses decreased the total score from 100% to just 70%: they used to be "geocoded" automatically but need the manual check now. This means:
- Much more manual revision
- In roughly 9 out of 10 cases the confirmation through the user is simply to accept hit number 1
- The underlying data structures are different between xLocate 1 (accesses the map data) and xlocate 2 (requires a special index directory structure)
- The engine's search is different
- The metric of the score computation has changed and xLocate2 returns also the field scores
- Threshold values for the distinguishing process may have to be aligned
- You may have to include the field scores into your process. As the interpretation of these values may depend on your business there's no generic approach:
- Some users may just have a focus on postalCode/city name with a low priority on street/housenumber
- Others would need a proper confirmation of the street and housenumber as well.
Bernd