Package DateHandler :: Module _Date_de :: Class DateParserDE
[frames] | no frames]

Class DateParserDE

source code


Instance Methods
 
init_strings(self)
This method compiles regular expression strings for matching dates.
source code

Inherited from _DateParser.DateParser: __init__, invert_year, match_bce, match_calendar, match_modifier, match_quality, match_range, match_span, parse, re_longest_first, set_date

Class Variables
  month_to_int = {u'10ber': 12, u'10bre': 12, u'10bris': 12, u'7...
  modifier_to_int = {u'ca.': 3, u'circa': 3, u'etwa': 3, u'gegen...
  calendar_to_int = {u'franz.': 3, u'französisch republikanisch'...
  quality_to_int = {u'ber.': 2, u'berechnet': 2, u'errechnet': 2...
  bce = ['vor unserer Zeitrechnung', 'vor unserer Zeit', 'vor de...

Inherited from _DateParser.DateParser: french_to_int, hebrew_to_int, islamic_to_int, modifier_after_to_int, persian_to_int

Method Details

init_strings(self)

source code 

This method compiles regular expression strings for matching dates.

Most of the re's in most languages can stay as is. span and range most likely will need to change. Whatever change is done, this method may be called first as DateParser.init_strings(self) so that the invariant expresions don't need to be repeteadly coded. All differences can be coded after DateParser.init_strings(self) call, that way they override stuff from this method. See DateParserRU() as an example.

Overrides: _DateParser.DateParser.init_strings
(inherited documentation)

Class Variable Details

month_to_int

Value:
{u'10ber': 12,
 u'10bre': 12,
 u'10bris': 12,
 u'7ber': 9,
 u'7bre': 9,
 u'7bris': 9,
 u'8ber': 10,
 u'8bre': 10,
...

modifier_to_int

Value:
{u'ca.': 3,
 u'circa': 3,
 u'etwa': 3,
 u'gegen': 3,
 u'nach': 2,
 u'um': 3,
 u'vor': 1}

calendar_to_int

Value:
{u'franz.': 3,
 u'französisch republikanisch': 3,
 u'greg.': 0,
 u'gregorianisch': 0,
 u'hebr.': 2,
 u'hebräisch': 2,
 u'isl.': 5,
 u'islamisch': 5,
...

quality_to_int

Value:
{u'ber.': 2,
 u'berechnet': 2,
 u'errechnet': 2,
 u'gesch.': 1,
 u'geschätzt': 1}

bce

Value:
['vor unserer Zeitrechnung',
 'vor unserer Zeit',
 'vor der Zeitrechnung',
 'vor der Zeit',
 'v. u. Z.',
 'v. d. Z.',
 'v.u.Z.',
 'v.d.Z.',
...