INDEX
Explanations
the phrase "what it is."
phrases that define or inquire about an identity or a state
New Auto-Interp
Negative Logits
zan
-0.69
anches
-0.65
ãĤī
-0.62
Yard
-0.61
Guam
-0.61
²
-0.59
visors
-0.58
oved
-0.58
tted
-0.57
mates
-0.57
POSITIVE LOGITS
supposed
0.84
meant
0.83
doing
0.78
referring
0.76
happening
0.75
experiencing
0.72
nt
0.70
going
0.68
elf
0.68
expecting
0.68
Activations Density 0.074%