Skip to content

Error with section "Erreur*" #24

@lrosique

Description

@lrosique

In 58 pages' words, we have a section named Erreur*

Due to the star in the name, the pyquery operation crashes so this section must be removed.

Here's the error

Traceback (most recent call last):

  File "D:\dev\Python\Python387\lib\site-packages\IPython\core\interactiveshell.py", line 3418, in run_code
    exec(code_obj, self.user_global_ns, self.user_ns)

  File "<ipython-input-1-0f4a11aaa388>", line 3, in <module>
    page.get_word_data["partOfSpeech"]

  File "d:\workspaces\WiktionnaireParser\wiktionnaireparser\parser.py", line 77, in get_word_data
    'partOfSpeech': self.get_parts_of_speech(),

  File "d:\workspaces\WiktionnaireParser\wiktionnaireparser\parser.py", line 140, in get_parts_of_speech
    nice_section_name = self._real_section_name(section_name)

  File "d:\workspaces\WiktionnaireParser\wiktionnaireparser\parser.py", line 128, in _real_section_name
    section = self._query.find(section_name)

  File "D:\dev\Python\Python387\lib\site-packages\pyquery\pyquery.py", line 677, in find
    xpath = self._css_to_xpath(selector)

  File "D:\dev\Python\Python387\lib\site-packages\pyquery\pyquery.py", line 282, in _css_to_xpath
    return self._translator.css_to_xpath(selector, prefix)

  File "D:\dev\Python\Python387\lib\site-packages\cssselect\xpath.py", line 192, in css_to_xpath
    for selector in parse(css))

  File "D:\dev\Python\Python387\lib\site-packages\cssselect\parser.py", line 415, in parse
    return list(parse_selector_group(stream))

  File "D:\dev\Python\Python387\lib\site-packages\cssselect\parser.py", line 428, in parse_selector_group
    yield Selector(*parse_selector(stream))

  File "D:\dev\Python\Python387\lib\site-packages\cssselect\parser.py", line 436, in parse_selector
    result, pseudo_element = parse_simple_selector(stream)

  File "D:\dev\Python\Python387\lib\site-packages\cssselect\parser.py", line 544, in parse_simple_selector
    raise SelectorSyntaxError(

  File "<string>", line unknown
SelectorSyntaxError: Expected selector, got <DELIM '*' at 7>

Here are some words :

  • malette
  • pillier
  • léthal
  • trippe

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions