INDEX
Explanations
references to things being at the center of attention or focus
phrases that indicate a central or main focus in a context
New Auto-Interp
Negative Logits
ishable
-0.76
utan
-0.64
à©
-0.64
é¾įå
-0.61
)--
-0.60
hua
-0.60
syn
-0.60
à¨
-0.60
ratom
-0.59
chops
-0.59
POSITIVE LOGITS
Initialized
0.78
gravity
0.77
igm
0.70
rency
0.69
inence
0.69
ierre
0.67
EVA
0.67
pole
0.66
gie
0.65
dyl
0.63
Activations Density 0.067%