INDEX
Explanations
words related to farewells and emotional connections
New Auto-Interp
Negative Logits
NAL
-0.15
ارش
-0.14
réuss
-0.13
Enabled
-0.12
ç·Ĵ
-0.12
Realm
-0.12
inded
-0.12
letic
-0.12
others
-0.12
286
-0.12
POSITIVE LOGITS
ol
0.34
ole
0.28
Mr
0.28
Poor
0.26
poor
0.26
Poor
0.24
Mr
0.24
dear
0.23
mr
0.22
èĢģ
0.20
Activations Density 0.337%