INDEX
Explanations
phrases related to granting permission or enabling actions
phrases indicating the ability or capacity to enable actions or functions
New Auto-Interp
Negative Logits
worn
-0.66
ko
-0.64
-0.63
boy
-0.63
ta
-0.62
xon
-0.60
ohm
-0.59
kind
-0.58
wa
-0.57
Horton
-0.57
POSITIVE LOGITS
geries
0.94
us
0.89
Reviewer
0.88
hift
0.84
users
0.83
ories
0.80
seamless
0.80
iences
0.78
awaru
0.78
icial
0.77
Activations Density 0.092%