INDEX
Explanations
specific references to a particular concept or entity
instances of the word "this" in relation to various contexts and topics
New Auto-Interp
Negative Logits
icons
-0.88
lee
-0.79
å§«
-0.77
okers
-0.76
acers
-0.76
mates
-0.75
visors
-0.75
ashes
-0.74
masters
-0.74
master
-0.73
POSITIVE LOGITS
particular
1.07
newfound
1.03
phenomenon
1.02
trope
1.02
century
0.93
avenue
0.93
kind
0.93
hemisphere
0.92
millenn
0.91
enigmatic
0.91
Activations Density 0.226%