INDEX
Explanations
pronouns referring to a group of people
New Auto-Interp
Negative Logits
profitability
-0.73
Affordable
-0.65
Fortress
-0.64
Ribbon
-0.64
Estate
-0.63
Cu
-0.63
Cance
-0.62
AMERICA
-0.62
mascot
-0.61
Mansion
-0.59
POSITIVE LOGITS
perceive
1.16
intu
1.16
understand
1.15
subconscious
1.14
infer
1.10
understanding
1.09
instinctively
1.09
know
1.08
wonder
1.07
wondering
1.07
Activations Density 0.505%