INDEX
Explanations
phrases indicating one example or part of several within a larger context
instances of phrases emphasizing the notion of being just one among many, often in a context of comparison or exemplification
New Auto-Interp
Negative Logits
same
-0.72
iano
-0.65
regn
-0.65
nob
-0.63
raped
-0.62
owers
-0.60
disbanded
-0.60
then
-0.60
sleep
-0.58
acha
-0.58
POSITIVE LOGITS
examples
0.99
scratching
0.98
iceberg
0.94
sampling
0.90
anecdotal
0.87
sympt
0.86
example
0.81
sample
0.79
symptom
0.79
illustration
0.78
Activations Density 0.115%