INDEX
Explanations
expressions or phrases indicating change and complexity in situations
New Auto-Interp
Negative Logits
fundos
-0.51
ukseen
-0.47
sentenza
-0.45
forums
-0.45
tepat
-0.45
лені
-0.45
bashrc
-0.45
التالي
-0.44
nepiecieš
-0.44
camadas
-0.43
POSITIVE LOGITS
things
1.18
things
1.15
Things
1.13
Things
1.10
THINGS
1.06
thing
0.99
клопе
0.98
THING
0.90
THING
0.87
thing
0.81
Activations Density 0.223%