INDEX
Explanations
names of individuals and specific places or entities
entities related to prominent public figures and organizations, particularly in relation to politics and society
New Auto-Interp
Negative Logits
omit
-0.55
fters
-0.54
©¶æ
-0.54
Topics
-0.53
disparate
-0.50
fter
-0.50
rocal
-0.50
¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯
-0.49
alyses
-0.49
contemporaries
-0.49
POSITIVE LOGITS
*.
0.89
.[
0.85
.</
0.84
.
0.84
_.
0.82
.ãĢį
0.79
itself
0.78
.–
0.78
!.
0.76
!!!!!!!!
0.75
Activations Density 1.011%