INDEX
Explanations
references to struggles or failures in various contexts
New Auto-Interp
Negative Logits
reesome
-0.17
hurst
-0.15
ÙĩÙĩ
-0.15
upos
-0.15
меÑĩ
-0.15
invalid
-0.14
outnumber
-0.14
teki
-0.14
roken
-0.14
iyah
-0.14
POSITIVE LOGITS
struggle
0.38
struggles
0.34
lim
0.30
lag
0.29
fal
0.28
langu
0.27
struggled
0.26
fl
0.26
wall
0.26
strugg
0.25
Activations Density 0.281%