INDEX
    Explanations

    mentions of familiarity or lack thereof with certain topics or concepts

    references to familiarity and unfamiliarity with concepts or subjects

    New Auto-Interp
    Negative Logits
    Ħ¢
    -0.69
    tein
    -0.65
    avorite
    -0.63
     practicable
    -0.63
    ĺħ
    -0.58
    ument
    -0.57
    ²¾
    -0.57
    ©¶æ
    -0.56
    efficiency
    -0.56
    secondary
    -0.56
    POSITIVE LOGITS
    izing
    1.21
    ized
    1.19
    ize
    1.09
    ising
    1.06
    ity
    1.05
    ities
    1.02
    ised
    1.02
    enough
    1.00
    izes
    0.97
    lly
    0.95
    Act Density 0.032%

    No Known Activations