INDEX
Negative Logits
Misc
0.39
untitled
0.38
superficially
0.37
Behind
0.37
слегка
0.35
தோ
0.35
ક્ત
0.34
sedikit
0.34
程度の
0.34
Introducing
0.34
POSITIVE LOGITS
affiliation
0.63
clue
0.59
probleme
0.56
documentation
0.55
warranty
0.55
affiliations
0.54
difficulty
0.54
issues
0.53
clues
0.53
luck
0.52
Activations Density 0.012%