INDEX
Explanations
subjective language
moderate adjectives
New Auto-Interp
Negative Logits
!*\
-0.65
runApp
-0.57
__':
-0.56
urlpatterns
-0.55
Référence
-0.54
MarshalTo
-0.53
препратки
-0.53
gdx
-0.52
للمعارف
-0.52
جغرافيا
-0.52
POSITIVE LOGITS
bit
1.60
slightly
1.59
somewhat
1.50
slightly
1.41
Slightly
1.41
biraz
1.30
Slightly
1.30
lidt
1.30
somewhat
1.29
nieco
1.27
Activations Density 1.402%