INDEX
Explanations
references to positions of authority or notable individuals
New Auto-Interp
Negative Logits
eref
-0.18
rawer
-0.16
иÑĤи
-0.15
íĭ±
-0.15
ominator
-0.15
romatic
-0.15
apter
-0.15
ÃŃž
-0.14
inciple
-0.14
éģķ
-0.14
POSITIVE LOGITS
akin
0.16
PLIT
0.15
/current
0.15
Energ
0.15
ulse
0.14
inde
0.14
PLY
0.14
panion
0.14
Äĥng
0.14
AYOUT
0.14
Activations Density 0.177%