INDEX
Explanations
instances of the word "pretty" or expressions of relative quality
New Auto-Interp
Negative Logits
ettings
-0.19
ooth
-0.15
ourcem
-0.15
hints
-0.15
esco
-0.15
sic
-0.15
acad
-0.15
omer
-0.15
andest
-0.15
Ñģен
-0.14
POSITIVE LOGITS
-ÑĤаки
0.22
»
0.15
못
0.15
../../../
0.15
lick
0.14
leared
0.14
Ù
0.14
ColumnName
0.14
alker
0.14
ROC
0.14
Activations Density 0.046%