INDEX
Explanations
expressions of personal experience and feelings
New Auto-Interp
Negative Logits
kasarigan
-1.07
noDo
-0.82
Personensuche
-0.82
betweenstory
-0.82
InitVars
-0.80
دانشنامهٔ
-0.79
EDEFAULT
-0.78
kaarangay
-0.76
Efq
-0.73
-0.73
POSITIVE LOGITS
'
1.34
’
1.32
ve
0.89
ve
0.79
`
0.75
â
0.69
have
0.68
&#
0.64
\'
0.61
v
0.60
Activations Density 0.211%