INDEX
Explanations
expressions of personal feelings and assessments of value or importance
New Auto-Interp
Negative Logits
ãĥ¼ãĥ
-0.18
oleÄį
-0.15
اÙĦسÙħ
-0.15
gregation
-0.15
apur
-0.14
apg
-0.14
ankind
-0.14
519
-0.14
angent
-0.14
anges
-0.14
POSITIVE LOGITS
orie
0.16
باز
0.14
qn
0.14
adal
0.14
dup
0.14
orian
0.14
ÏĢÎŃ
0.14
Textbox
0.13
ear
0.13
Ĺi
0.13
Activations Density 0.140%