INDEX
    Explanations

    phrases or words related to showing preference or support for something

    expressions of support or preference for something

    New Auto-Interp
    Negative Logits
    colored
    -1.31
    --
    -1.23
    defense
    -1.21
    avored
    -1.19
    bors
    -1.08
     canceled
    -1.07
     traveled
    -1.06
     honor
    -1.03
     honoring
    -1.02
     labeled
    -1.02
    POSITIVE LOGITS
    colour
    2.21
     recognise
    2.20
     realise
    2.20
     colour
    2.15
     colours
    2.15
     realised
    2.13
     organise
    2.11
     flavour
    2.09
     humour
    2.08
     practise
    2.08
    Act Density 0.091%

    No Known Activations