WIP: Reimplementing search_dates#945
WIP: Reimplementing search_dates#945gavishpoddar wants to merge 44 commits intoscrapinghub:masterfrom
search_dates#945Conversation
|
Hi, I need a suggestion should I use translated chunks or the original chunks to further parse the data objects. I have currently used translated chunks instead of original chunks as this increased accuracy in some basic tests. Thanks, and please suggest. |
search_dates & fixing search translation
Hi @gavishpoddar, using |
Codecov ReportBase: 98.23% // Head: 98.10% // Decreases project coverage by
Additional details and impacted files@@ Coverage Diff @@
## master #945 +/- ##
==========================================
- Coverage 98.23% 98.10% -0.13%
==========================================
Files 232 235 +3
Lines 2604 2692 +88
==========================================
+ Hits 2558 2641 +83
- Misses 46 51 +5
Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. ☔ View full report at Codecov. |
|
Hi @noviluni, I have made the changes for tests to work. Can you please approve the workflow approval? |
|
@gavishpoddar workflows approved 👍 |
|
I need help with one test
|
|
Hi after some changes the codes are very much compatible with the old Currently, this PR is left with some docs changes (just replacing the current with new docs and |
search_dates & fixing search translationsearch_dates & fixing search translation
|
Replaced previous |
lopuhin
left a comment
There was a problem hiding this comment.
Thanks @gavishpoddar , I left some comments regarding tests. Please tell if you need advice with how to implement xfail.
lopuhin
left a comment
There was a problem hiding this comment.
Thanks @gavishpoddar a few docs suggestions
Co-authored-by: Konstantin Lopuhin <kostia.lopuhin@gmail.com>
|
Hey, this PR is updated with the #932 |
|
Hi @gavishpoddar, I found your PR to improve the behavior of Is there anything I can do to help get this PR merged? |
| def __init__(self): | ||
| self.loader = LocaleDataLoader() | ||
| self.available_language_map = self.loader.get_locale_map() | ||
| self.search = _ExactLanguageSearch(self.loader) |
There was a problem hiding this comment.
The removal of DateSearchWithDetection.search is backward-incompatible.
There was a problem hiding this comment.
I can create a shortcut the make DateSearchWithDetection.search and add a deprecation warning or simply rename.
Please suggest a preferred action.
There was a problem hiding this comment.
I think keeping the old objects around with their old names, logging a warning when used (i.e. by exposing them through a property whose getter logs a warning), would be ideal.
search_datessearch_dates
Reimplementing and simplifying
search_datesA reimplemented and simplified
search_dateswhich more directly usesdateparser.parse, improves accuracy and fixes many bugsNew Feature:
search_first_date- searches and returns the first date from the given text.NOTE: This PR is inspired by the previous implementation of search_dates and #931.
TODO
DATE_ORDER