INDEX
Explanations
content related to misinformation and false narratives
New Auto-Interp
Negative Logits
+:+
-0.67
parsedMessage
-0.60
defaultstate
-0.56
Hentet
-0.56
CppCodeGen
-0.55
Públicas
-0.52
Bước
-0.52
Jacobi
-0.52
quæ
-0.51
doInBackground
-0.50
POSITIVE LOGITS
Trump
1.48
Trump
1.32
Donald
1.00
Donald
0.98
trump
0.94
trump
0.83
DONALD
0.82
DONALD
0.81
donald
0.79
donald
0.78
Activations Density 0.422%