INDEX
Explanations
actions and verbs related to tasks or commands
New Auto-Interp
Negative Logits
houſe
-1.05
ſtate
-1.03
myſelf
-1.02
purpoſe
-0.97
pleaſure
-0.95
poffe
-0.92
itſelf
-0.92
Jefus
-0.91
fubject
-0.91
perſon
-0.91
POSITIVE LOGITS
矶
0.75
be
0.69
0.67
icoot
0.65
ETHING
0.64
“
0.63
"]=
0.62
"]="
0.60
stel
0.58
ENOS
0.57
Activations Density 0.034%