INDEX
Explanations
phrases related to personal experiences and opinions
expressions of personal feelings and reflections, particularly concerning friendships and serious situations
New Auto-Interp
Negative Logits
Kinnikuman
-0.65
asonic
-0.63
Lovecraft
-0.62
Whedon
-0.62
untled
-0.61
Buch
-0.61
ģĸ
-0.61
STDOUT
-0.60
¶ħ
-0.59
ories
-0.59
POSITIVE LOGITS
.,"
1.00
,''
0.99
sic
0.94
,'"
0.89
[-
0.88
),"
0.87
['
0.86
`
0.83
',"
0.81
,'
0.80
Activations Density 0.929%