INDEX
Explanations
words and phrases that convey a sense of struggle or challenge
New Auto-Interp
Negative Logits
Verfügung
-0.14
/Instruction
-0.14
)↵↵↵↵↵↵↵↵
-0.14
allo
-0.14
uard
-0.14
417
-0.13
416
-0.13
iversit
-0.13
.updateDynamic
-0.13
/do
-0.13
POSITIVE LOGITS
uality
0.19
-looking
0.19
YNAM
0.16
nÃło
0.16
Ùį
0.16
/null
0.16
ly
0.16
iards
0.15
ned
0.15
جدا
0.15
Activations Density 0.131%