INDEX
Explanations
instances of punctuation and sentence structure
New Auto-Interp
Negative Logits
rag
-0.17
myfile
-0.16
rio
-0.15
ride
-0.15
allo
-0.15
perc
-0.15
Mile
-0.14
ÙĦات
-0.14
perc
-0.14
ÑĢой
-0.14
POSITIVE LOGITS
)test
0.16
intent
0.14
intent
0.14
ÌĨ
0.14
unlike
0.14
_Free
0.14
Intent
0.14
intents
0.14
erais
0.14
éru
0.13
Activations Density 1.063%