INDEX
    Explanations

    references to models and their specifications

    archaic or non-english words

    the token "model" (occurrences of the word "model", often as a speaker/label token).

    New Auto-Interp
    Negative Logits
     Италијани
    -0.44
     written
    -0.41
     pyplot
    -0.37
    PutMapping
    -0.37
    AutoField
    -0.37
    statechange
    -0.37
     Chwiliwch
    -0.37
    CardModule
    -0.36
     surla
    -0.36
    طلحات
    -0.36
    POSITIVE LOGITS
    principalColumn
    0.57
     humains
    0.53
     répon
    0.52
     purpoſe
    0.52
     utafitiHapana
    0.50
    NUMX
    0.50
    GOTREF
    0.50
     civilización
    0.50
     avoient
    0.49
     fhew
    0.49
    Act Density 0.001%

    No Known Activations