INDEX
Explanations
text related to regulations or legal matters
collective actions or sentiments within a community
New Auto-Interp
Negative Logits
himself
-0.66
herself
-0.66
Digest
-0.56
Adolf
-0.56
elson
-0.55
stroke
-0.54
itto
-0.51
ented
-0.50
rubbed
-0.49
Reilly
-0.49
POSITIVE LOGITS
ourselves
1.58
our
1.00
OUR
0.86
ours
0.84
Our
0.76
asses
0.74
Our
0.72
selves
0.68
collectively
0.60
blogs
0.60
Activations Density 0.957%