INDEX
Explanations
references to oaths and vows, particularly in a context involving love and commitment
New Auto-Interp
Negative Logits
å¡
-0.18
rana
-0.14
quier
-0.14
itzer
-0.14
ernel
-0.14
Perr
-0.14
witter
-0.13
Ñĥз
-0.13
ojis
-0.13
oes
-0.13
POSITIVE LOGITS
oath
0.27
allegiance
0.25
sworn
0.23
swearing
0.19
swore
0.19
vows
0.18
swear
0.18
binds
0.18
pled
0.17
pledge
0.17
Activations Density 0.032%