INDEX
    Explanations

    statements or quotations in text

    dialogue or statements made by individuals

    New Auto-Interp
    Negative Logits
     Vaugh
    -0.70
    ãĥ¼ãĥĨ
    -0.63
    Redd
    -0.62
    amac
    -0.60
     condem
    -0.60
    egu
    -0.60
     disadvant
    -0.58
    Mobil
    -0.57
     advoc
    -0.57
     lapt
    -0.57
    POSITIVE LOGITS
     âĢº
    0.55
     guiActive
    0.49
     â
    0.49
     âľ
    0.48
     crochet
    0.46
     ye
    0.46
     ages
    0.45
    vernment
    0.45
     ·
    0.45
     TRI
    0.45
    Act Density 0.452%

    No Known Activations