INDEX
Explanations
personal pronouns followed by verbs that indicate knowledge or action taken by someone
references to personal connections and relationships
New Auto-Interp
Negative Logits
Spur
-0.69
vine
-0.67
TBD
-0.66
Barn
-0.60
Boolean
-0.59
Euph
-0.59
sorcery
-0.58
Clever
-0.58
Interstitial
-0.57
Appendix
-0.57
POSITIVE LOGITS
arers
0.80
athered
0.76
adore
0.74
frequ
0.74
istries
0.73
encountered
0.72
victimized
0.72
despise
0.71
uras
0.71
worshipped
0.71
Activations Density 0.197%