INDEX
Explanations
words related to reputation or representation
words or terms related to representation and reporters
New Auto-Interp
Negative Logits
istically
-0.73
scape
-0.66
gha
-0.62
gaard
-0.61
Archdemon
-0.60
EntityItem
-0.59
Brist
-0.59
wings
-0.58
Conditions
-0.57
=-=-=-=-
-0.57
POSITIVE LOGITS
ublic
1.52
resents
1.18
atri
1.04
orters
0.99
rint
0.96
ulse
0.95
ession
0.94
itude
0.87
rising
0.87
aint
0.86
Activations Density 0.019%