INDEX
Explanations
the word "that" followed by characteristic or descriptive elements
the repetition of the word "that" in various contexts
New Auto-Interp
Negative Logits
endants
-0.93
osponsors
-0.93
okers
-0.86
osures
-0.86
pps
-0.86
amps
-0.85
sts
-0.84
asons
-0.84
anches
-0.84
bugs
-0.83
POSITIVE LOGITS
uniqueness
1.06
intangible
1.03
newfound
1.03
openness
1.03
richness
1.02
ambiguity
1.00
totality
1.00
knowledge
0.99
momentum
0.99
mindset
0.98
Activations Density 0.151%