INDEX
Explanations
positive affirmations or recommendations about objects or experiences
New Auto-Interp
Negative Logits
None
-0.35
ninguna
-0.35
None
-0.34
ninguno
-0.32
själva
-0.31
cả
-0.31
Вікіпе
-0.30
ningún
-0.29
none
-0.28
fflush
-0.27
POSITIVE LOGITS
practically
2.84
almost
2.81
virtually
2.70
nearly
2.64
prácticamente
2.52
Almost
2.50
Almost
2.47
praticamente
2.45
quase
2.44
almost
2.42
Activations Density 1.011%