INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    OCCURRED
    -0.84
    VIAF
    -0.84
    aarrggbb
    -0.79
     calendriers
    -0.77
    DataAnnotations
    -0.77
    ffindor
    -0.74
    akuza
    -0.74
    antart
    -0.73
    theon
    -0.72
    PreferredItem
    -0.71
    POSITIVE LOGITS
     All
    0.94
    All
    0.86
    <bos>
    0.52
    ↵↵
    0.51
     történ
    0.51
     to
    0.49
    0.49
     a
    0.49
     i
    0.48
    0.48
    Act Density 0.194%

    No Known Activations