INDEX
Explanations
phrases related to expressing agreement or positive sentiment towards something
variations of the word "like" in different contexts
New Auto-Interp
Negative Logits
tein
-0.75
exting
-0.71
Catal
-0.70
destro
-0.69
enthusi
-0.68
inas
-0.68
Ire
-0.67
ilts
-0.66
Parsons
-0.65
éŃĶ
-0.65
POSITIVE LOGITS
lihood
2.03
liest
1.17
minded
1.09
minded
1.08
lier
1.03
liness
1.02
ability
1.01
able
0.89
joy
0.78
ably
0.73
Activations Density 0.048%