INDEX
Explanations
references to technology and legislative issues
New Auto-Interp
Negative Logits
нд
-0.16
ovaly
-0.16
/cop
-0.16
ylland
-0.15
Balance
-0.15
Dispatch
-0.14
onse
-0.14
วล
-0.14
глÑıд
-0.14
ikip
-0.14
POSITIVE LOGITS
war
0.21
War
0.20
l
0.18
war
0.18
_war
0.17
ban
0.17
Mor
0.17
-war
0.16
War
0.16
allen
0.16
Activations Density 0.005%