INDEX
Explanations
quotations or dialogue indicators
New Auto-Interp
Negative Logits
luv
-0.16
wat
-0.16
isoft
-0.15
oola
-0.15
lob
-0.14
puted
-0.14
.toLocale
-0.14
ols
-0.14
ulers
-0.14
Verdana
-0.14
POSITIVE LOGITS
_PF
0.16
odge
0.16
âĹĦ
0.15
retention
0.15
endance
0.15
ret
0.15
вÑĸд
0.15
butterfly
0.14
rock
0.14
adi
0.14
Activations Density 0.029%