INDEX
Explanations
terms related to bold actions or statements
the repeated use of the word "bold" describing actions or policies
New Auto-Interp
Negative Logits
Cheong
-0.88
ADS
-0.79
OTOS
-0.77
VA
-0.70
UTERS
-0.69
COURT
-0.67
AW
-0.67
OTA
-0.66
Gun
-0.64
UT
-0.64
POSITIVE LOGITS
bold
1.26
bold
1.25
Ital
1.02
faced
0.97
ital
0.93
face
0.91
daring
0.84
itude
0.82
Bold
0.79
éĹĺ
0.79
Activations Density 0.006%