INDEX
Explanations
clear and significant descriptors related to qualities and differences
New Auto-Interp
Negative Logits
pinulongan
-0.65
UserScript
-0.52
ValueStyle
-0.43
IFTT
-0.43
Agustus
-0.43
czasem
-0.43
cafetería
-0.43
kaarangay
-0.43
はじめに
-0.42
disambiguazione
-0.42
POSITIVE LOGITS
strongly
0.65
heavily
0.60
sekali
0.55
deeply
0.54
Strongly
0.54
greatly
0.53
strict
0.53
strongly
0.52
Strongly
0.52
מאוד
0.52
Activations Density 0.761%