INDEX
Explanations
phrases containing the word "exactly" followed by a number
New Auto-Interp
Negative Logits
rift
-0.81
ker
-0.74
ework
-0.73
itiz
-0.68
respectfully
-0.65
kers
-0.64
asta
-0.64
strong
-0.64
tein
-0.63
enthusiastically
-0.62
POSITIVE LOGITS
opposite
0.85
ãĤ¨
0.76
itude
0.69
æ©Ł
0.65
actly
0.63
replicate
0.62
same
0.62
ãĥ¯
0.62
tuned
0.62
identical
0.62
Activations Density 0.358%