INDEX
Explanations
mentions of specific cultural or historical references related to preservation and recognition
Follows punctuation and question words
making, taking, putting
New Auto-Interp
Negative Logits
مشين
-0.58
guenos
-0.57
ویکیپدیای
-0.57
된
-0.56
副本
-0.56
trường
-0.56
صوتيه
-0.54
łóż
-0.54
Fazit
-0.53
#
-0.53
POSITIVE LOGITS
Making
1.43
Making
1.35
Taking
1.35
Taking
1.30
Putting
1.29
Getting
1.26
Giving
1.25
Bringing
1.24
Doing
1.24
Getting
1.22
Activations Density 0.311%