INDEX
Explanations
expressions or phrases where something is described or paraphrased
the word "it" used in various contexts
New Auto-Interp
Negative Logits
hift
-0.76
ppa
-0.71
IELD
-0.69
adan
-0.68
Panama
-0.63
Flavoring
-0.63
icial
-0.61
424
-0.60
ept
-0.59
Java
-0.59
POSITIVE LOGITS
self
0.97
bluntly
0.80
selves
0.77
chy
0.73
unes
0.72
succinct
0.70
mildly
0.69
anooga
0.69
sarcast
0.68
selves
0.68
Activations Density 0.051%