INDEX
Explanations
negations and quantities within the text
New Auto-Interp
Negative Logits
418
-0.15
Fior
-0.15
anje
-0.15
ARRIER
-0.14
ARD
-0.14
ActionTypes
-0.14
OUNDS
-0.14
krv
-0.14
vendor
-0.14
Vendor
-0.14
POSITIVE LOGITS
aln
0.18
áln
0.17
agna
0.16
æľºåħ³
0.16
alte
0.14
igu
0.14
ayıp
0.14
egal
0.14
arch
0.14
Roths
0.14
Activations Density 0.003%