INDEX
Explanations
mentions of unofficial or unauthorized activities
the term "unofficial" in various contexts
New Auto-Interp
Negative Logits
ĸļ
-0.88
hma
-0.83
oran
-0.76
ebook
-0.75
erm
-0.74
gans
-0.74
nesota
-0.73
ggle
-0.71
xual
-0.70
ulhu
-0.70
POSITIVE LOGITS
unofficial
1.19
official
0.87
referen
0.79
official
0.75
truce
0.73
representative
0.73
informal
0.73
canonical
0.72
accompan
0.69
nonpartisan
0.68
Activations Density 0.004%