Skip to content

Morphology support #1

@vgel

Description

@vgel

Right now terminal tokens have to be separate words. Treebender should be able to support morphological rules:

V[ stem: t ] -> walk
V[ stem: t ] -> talk
// stem: f to block walkedededededededed...
V[ tense: past, stem: f ] -> V[ stem: t ] ++ ed  // syntax TBD

Questions:

  • What scope do we want here? Are we only supporting basic concatenative morphology (prefixes and suffixes), or will we try and support allomorphy, sound changes / ablaut, semitic roots...
    • It's tempting to say we just focus on English and support concatenative and allow the user to fall back with a flag:
        V[ can_inflect: y ] -> walk
        V[ can_inflect: n ] -> buy
        V[ tense: past, can_inflect: n ] -> V[ can_inflect: y ] ++ ed
        V[ tense: past, can_inflect: n ] -> bought
    + However, lots of common words in English have changes like bake ~ baked not *bakeed. There's no real way to support that without some more sophisticated tool or tons of duplicate rules.
    
    

Todo:

  • Remind myself of how the LKB does this

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions