INDEX
Explanations
phrases related to situations or actions happening in front of someone
New Auto-Interp
Negative Logits
zn
-0.68
avorite
-0.68
cci
-0.67
chini
-0.67
redited
-0.66
Strait
-0.64
zie
-0.63
Preferred
-0.60
ajor
-0.60
risome
-0.59
POSITIVE LOGITS
aday
0.89
of
0.81
matter
0.75
thereof
0.74
runners
0.74
isp
0.74
woods
0.71
iers
0.71
of
0.70
screen
0.69
Activations Density 0.013%