-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Revit_Toolkit: remove whitespaces when matching names on Push #574
Comments
After a quick glance, this looks like a more convoluted problem than expected. Therefore pushing to 3.2, to be resolved together with #582 |
I could resolve it now already, but I started thinking of some more intelligent name matching than simply removal of whitespaces: it would be great if HEB200 could match not only with HEB 200 but also HE200B etc. I am curious if it would not be worth implementing a more intelligent string matching mechanism in general in order to handle typos or minor mismatches - this could be useful in other toolkits, but also e.g. in the method search. What do you think @al-fisher @IsakNaslundBh @FraserGreenroyd? |
At what point is I would agree with removing spaces at most. But I wouldn't agree with any other changes to the string - I would rather error out to the user and let them fix it to make sure they're getting the right workflow, and not the workflow we think might be right. |
HEB200 and HE200B are 2 commonly used names for the same thing. My gut feeling is we can find many more of such. |
Yes agreed this is worth putting some thought into generalising. I think the key here is to not "hard code" the matching assumptions, as for given work flows it will be really valuable to be able to override, add to or customise - what matches to what. I think we'll ultimately need a specific option for string comparing that allows user input. In fact think this is effectively a Comparer Config specific to String comparison. @alelom @pawelbaran We can then create very simple standard configs (combinations of settings) and/or datasets of common strings that are equivalent - to help the most common workflows |
This sounds like a Milestone workshop to me, to get others' thoughts too? |
Agreed |
Sounds good |
Agree with all the above. Also links in to BHoM/BHoM_Datasets#60 which is another place for the exact same issue of sections having slightly different names in slightly different context. There I had some idea of some hard-coded alternatives stored on the sections, but if we can fix it with some more cleaver string comparison matching, that would be even better. |
This issue seems relevant to the recent discussion on fuzzy string matching! |
Love it, thanks @vietle-bh! |
Description:
Atm the BHoM properties are being matched with Revit types based on names - sometimes this does not work due to whitespaces on either side (e.g. HEB200 vs HEB 200). Would be good to ignore these.
The text was updated successfully, but these errors were encountered: