INDEX
Explanations
specific phrases related to the application or relevance of information
phrases indicating applicability or relevance to various contexts
New Auto-Interp
Negative Logits
ikk
-0.74
GOP
-0.72
erg
-0.69
inus
-0.66
manac
-0.65
ocative
-0.64
ipedia
-0.64
trap
-0.64
inder
-0.64
ãĥīãĥ©
-0.63
POSITIVE LOGITS
EVERY
1.20
every
1.18
both
1.18
virtually
1.14
everyone
1.12
everything
1.11
all
1.11
everybody
1.10
only
1.08
ALL
1.06
Activations Density 0.533%