INDEX
Explanations
positive descriptors and affirmations related to people's actions or characteristics
New Auto-Interp
Negative Logits
ãģ¾ãģļ
-0.18
sice
-0.18
totiž
-0.18
however
-0.17
/MPL
-0.15
ابتدا
-0.15
åıĬåħ¶
-0.15
firstly
-0.15
Firstly
-0.14
walker
-0.14
POSITIVE LOGITS
therefore
0.29
thus
0.24
thus
0.23
hence
0.23
thereby
0.22
rogen
0.21
Therefore
0.20
consequently
0.19
поÑįÑĤомÑĥ
0.19
Therefore
0.19
Activations Density 0.273%