INDEX
Explanations
names of people and concepts related to politics, economics, or academia
names of well-known figures and entities, particularly in the context of discussions or references
New Auto-Interp
Negative Logits
skelet
-0.74
UNCLASSIFIED
-0.72
disadvant
-0.59
etheless
-0.58
exting
-0.57
pse
-0.57
âķIJ
-0.56
Azerb
-0.54
âĹ¼
-0.54
âĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢ
-0.54
POSITIVE LOGITS
[â̦]
0.70
âĢİ
0.65
âĢİ
0.61
âĢº
0.58
Donald
0.54
↵Âł
0.53
....
0.51
Posted
0.51
...
0.48
20439
0.48
Activations Density 6.016%