INDEX
Explanations
positive adjectives or compliments
occurrences of the verb "to be" in various forms
New Auto-Interp
Negative Logits
Achieve
-0.72
osate
-0.71
cite
-0.70
icipated
-0.68
irm
-0.68
undertook
-0.67
ilst
-0.67
iates
-0.66
Deter
-0.65
Sources
-0.65
POSITIVE LOGITS
gonna
1.29
definitely
1.11
nt
1.08
probably
1.00
kinda
0.95
fucked
0.94
awesome
0.93
amazing
0.91
supposed
0.89
pretty
0.88
Activations Density 0.749%