INDEX
Explanations
phrases that describe daily activities or experiences
New Auto-Interp
Negative Logits
/INFO
-0.15
UNCH
-0.15
ugin
-0.14
grandma
-0.14
boa
-0.14
inely
-0.13
pData
-0.13
UID
-0.13
UGIN
-0.13
izik
-0.13
POSITIVE LOGITS
Hub
0.45
hubs
0.42
hub
0.42
hub
0.40
Hub
0.39
Husband
0.38
dh
0.37
DH
0.36
Hubb
0.35
DH
0.34
Activations Density 0.277%