INDEX
Explanations
phrases related to political and social issues
concepts related to social issues and political discourse
New Auto-Interp
Negative Logits
appre
-0.55
TheNitromeFan
-0.50
CLS
-0.50
ajor
-0.49
Princ
-0.48
Published
-0.48
Wilkinson
-0.48
Dover
-0.47
IMAGES
-0.47
Berks
-0.45
POSITIVE LOGITS
thereto
0.81
thereof
0.79
).[
0.79
thereafter
0.74
)).
0.72
ãĥĺ
0.69
?).
0.67
therein
0.65
)?
0.63
accordingly
0.59
Activations Density 3.080%