INDEX
Explanations
references to political events and figures
New Auto-Interp
Negative Logits
AssemblyProduct
-0.56
rungsseite
-0.56
optarg
-0.54
libft
-0.54
bkz
-0.53
noDo
-0.51
Hochspringen
-0.51
kapturem
-0.50
ويكيميديا
-0.49
femininos
-0.49
POSITIVE LOGITS
Second
0.41
Second
0.40
بيها
0.38
MENAFN
0.37
Outside
0.37
Outside
0.36
Meanwhile
0.36
сно
0.35
Past
0.35
external
0.35
Activations Density 0.332%