INDEX
Explanations
phrases indicating attempts or efforts to perform actions
New Auto-Interp
Negative Logits
Futura
-0.68
Genesee
-0.60
constaté
-0.56
höchst
-0.52
ontal
-0.51
glow
-0.51
feasting
-0.50
พบ
-0.50
myſelf
-0.49
constater
-0.49
POSITIVE LOGITS
attempt
1.45
attempts
1.33
versucht
1.27
Attempt
1.25
Trying
1.24
versuchen
1.23
Attempts
1.20
tentando
1.20
trying
1.20
Trying
1.19
Activations Density 0.153%