INDEX
Explanations
phrases that indicate a strong commitment or belief
New Auto-Interp
Negative Logits
similar
-0.62
some
-0.61
similar
-0.58
ſmall
-0.57
unusual
-0.55
необы
-0.55
some
-0.54
Podob
-0.54
giovan
-0.54
unusually
-0.53
POSITIVE LOGITS
]),
0.78
utafitiHapana
0.77
heartedly
0.74
firmly
0.73
]}"
0.69
loem
0.69
squarely
0.68
"),
0.68
heartily
0.67
légales
0.65
Activations Density 0.376%