INDEX
Explanations
instances of the word "finds" followed by any word or phrase
the verb "find" and its various contexts in sentences
New Auto-Interp
Negative Logits
creen
-0.63
mandatory
-0.61
roud
-0.59
Haste
-0.57
<[
-0.57
ollo
-0.56
buff
-0.55
medium
-0.54
BS
-0.53
strict
-0.53
POSITIVE LOGITS
finds
3.32
discovers
2.09
find
1.76
find
1.75
Find
1.59
Find
1.58
sees
1.55
found
1.50
learns
1.48
finding
1.47
Activations Density 0.014%