INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    O
    0.52
     davvero
    0.50
     massimo
    0.50
     overhear
    0.47
    iglia
    0.45
     quanta
    0.44
     protagonist
    0.43
    ogliere
    0.43
     ovviamente
    0.43
     lasci
    0.43
    POSITIVE LOGITS
    '}).
    0.50
    0.46
     }).
    0.45
    тира
    0.45
    0.44
     aboriginal
    0.42
    冷却
    0.42
    ecological
    0.42
    0.41
    0.41
    Act Density 0.001%

    No Known Activations