INDEX
Explanations
relationships involving the passage of time or long durations
New Auto-Interp
Negative Logits
igner
-0.18
OK
-0.17
"Yeah
-0.16
opup
-0.16
opc
-0.16
somebody
-0.16
guys
-0.15
assi
-0.15
anybody
-0.15
everybody
-0.15
POSITIVE LOGITS
Papa
0.18
coin
0.16
Parliament
0.15
Wolver
0.15
tup
0.15
Wedding
0.15
physic
0.15
Tüm
0.15
oÄį
0.15
Boxing
0.15
Activations Density 0.014%