INDEX
Explanations
phrases related to subjective opinion or interpretation
New Auto-Interp
Negative Logits
incinn
-0.73
oute
-0.70
iland
-0.70
milo
-0.69
inate
-0.69
okane
-0.67
elta
-0.66
eni
-0.64
uala
-0.63
inav
-0.62
POSITIVE LOGITS
aforementioned
0.69
usual
0.67
attendant
0.67
evidenced
0.65
envis
0.65
liest
0.63
same
0.58
posed
0.58
equation
0.56
ħĭ
0.56
Activations Density 0.010%