INDEX
Explanations
words related to laws, regulations, and official terms
the usage of significant pronouns and determiners in context
New Auto-Interp
Negative Logits
republic
-0.64
vigilance
-0.64
istration
-0.63
scanner
-0.62
reservations
-0.61
eatured
-0.61
democr
-0.60
Braz
-0.59
tourism
-0.59
reel
-0.59
POSITIVE LOGITS
Own
0.83
Ones
0.80
ulhu
0.79
TPPStreamerBot
0.76
Started
0.74
Care
0.73
Maker
0.71
Himself
0.71
Points
0.71
wagen
0.70
Activations Density 0.591%