INDEX
    Explanations

    phrases related to research studies and scientific reports

    Appears after prepositions

    foreign or technical language

    New Auto-Interp
    Negative Logits
     tegens
    -0.35
     kautta
    -0.35
    forhold
    -0.34
    bės
    -0.34
     forhold
    -0.34
     mogelijkheden
    -0.33
    很多
    -0.32
    harapkan
    -0.32
     ways
    -0.32
     are
    -0.32
    POSITIVE LOGITS
    󠁣
    0.95
    ScopeManager
    0.91
     autorytatywna
    0.87
    parsedMessage
    0.86
    ſicht
    0.83
     deſſen
    0.82
     Wikimedijinoj
    0.81
     Мексичка
    0.80
    MLLoader
    0.79
    [@BOS@]
    0.78
    Act Density 0.637%

    No Known Activations