INDEX
Explanations
phrases that discuss governance and political systems
New Auto-Interp
Head Attr Weights
0:0.10
1:0.07
2:0.08
3:0.07
4:0.07
5:0.07
6:0.07
7:0.07
8:0.09
9:0.09
10:0.09
11:0.08
Negative Logits
guiActive
-1.92
millenn
-1.83
okemon
-1.79
Stim
-1.68
Rosenthal
-1.66
ITCH
-1.64
Rider
-1.63
Greater
-1.63
omet
-1.61
★★
-1.60
POSITIVE LOGITS
cabin
1.81
uments
1.72
sama
1.71
Untitled
1.67
osures
1.65
favors
1.60
mates
1.60
drafts
1.59
estation
1.56
avy
1.54
Activations Density 0.000%