INDEX
Explanations
words that indicate measurement or comparison
New Auto-Interp
Negative Logits
ôn
-0.16
iot
-0.16
PG
-0.15
odst
-0.15
çĵľ
-0.15
Reich
-0.14
CROSS
-0.14
uir
-0.14
Stick
-0.14
Isa
-0.14
POSITIVE LOGITS
Sum
0.17
aghan
0.17
rane
0.16
tie
0.16
462
0.15
-sum
0.15
Deniz
0.15
Cassidy
0.15
_SUM
0.14
summary
0.14
Activations Density 0.010%