INDEX
Explanations
phrases related to something being the basis or inspiration for something else
the term "based" in relation to various contexts or narratives
New Auto-Interp
Negative Logits
antha
-0.75
dra
-0.73
ESE
-0.71
apes
-0.70
女
-0.66
shr
-0.66
ãģı
-0.65
ests
-0.63
osen
-0.63
ese
-0.62
POSITIVE LOGITS
loosely
0.89
upon
0.79
solely
0.79
awaru
0.74
ragon
0.68
withd
0.68
illac
0.68
certific
0.68
amera
0.67
lly
0.66
Activations Density 0.022%