INDEX
Explanations
instances of the verb "was" followed by adjectives or prepositional phrases
repeated mentions of the word “was.”
New Auto-Interp
Negative Logits
Individuals
-0.67
Current
-0.66
preserves
-0.63
Which
-0.62
Highlands
-0.62
entails
-0.61
Various
-0.61
Countries
-0.60
Replacement
-0.60
bies
-0.60
POSITIVE LOGITS
wolves
1.01
fortunate
0.99
hoping
0.97
wondering
0.97
able
0.96
expecting
0.96
tempted
0.92
unable
0.92
lucky
0.91
afraid
0.91
Activations Density 0.201%