INDEX
Explanations
instances of the word "difficult" or related terms indicating challenges or struggles
New Auto-Interp
Negative Logits
orro
-0.15
quivo
-0.15
inox
-0.14
lify
-0.14
нг
-0.14
Concern
-0.14
orrow
-0.14
اگ
-0.14
folders
-0.13
elik
-0.13
POSITIVE LOGITS
khÄĥn
0.28
-to
0.28
ly
0.26
ies
0.21
icult
0.21
terrain
0.20
task
0.19
iating
0.18
going
0.18
-hard
0.18
Activations Density 0.026%