INDEX
Explanations
phrases related to approval or authorization
phrases or terms related to official approvals or permissions
New Auto-Interp
Negative Logits
obs
-0.71
zanne
-0.66
ĸļ
-0.65
zilla
-0.62
onomic
-0.61
Kahn
-0.61
jad
-0.61
ilant
-0.60
ISM
-0.59
Gall
-0.59
POSITIVE LOGITS
shoot
0.78
ments
0.71
undown
0.70
sites
0.68
abase
0.67
site
0.65
MENTS
0.65
ardon
0.63
loading
0.62
spr
0.62
Activations Density 0.041%