INDEX
    Explanations

    simulation, politics, foreign word parts

    New Auto-Interp
    Negative Logits
     Borges
    0.45
    Vis
    0.42
    Skirt
    0.41
     Edelstahl
    0.39
    Fil
    0.39
     Drilling
    0.39
     Zoro
    0.38
     Fried
    0.38
    azas
    0.37
     विपरीत
    0.37
    POSITIVE LOGITS
    वता
    0.42
    0.39
    ാപാ
    0.39
     szen
    0.37
    urn
    0.36
     democracy
    0.36
    megen
    0.36
    democracy
    0.35
     politics
    0.34
    ტი
    0.34
    Act Density 0.002%

    No Known Activations