INDEX
Explanations
political and geographic terms related to current events or disputes
phrases related to significant political events or statements
New Auto-Interp
Negative Logits
concess
-0.66
mathemat
-0.57
reluct
-0.53
notor
-0.51
multip
-0.50
prototyp
-0.49
benign
-0.49
conditional
-0.48
osite
-0.48
conscientious
-0.48
POSITIVE LOGITS
↵Âł
1.23
↵↵
1.12
[/
1.07
↵
1.07
Posted
1.04
"}
1.03
)?
0.98
.#
0.97
.?
0.97
"},"
0.97
Activations Density 2.561%