INDEX
Explanations
verbs related to exploration, investigation, or revelation
instances of the word "discovered."
New Auto-Interp
Negative Logits
stay
-0.63
orting
-0.60
tone
-0.59
voice
-0.59
depending
-0.59
regulation
-0.58
drive
-0.58
clinton
-0.57
VOL
-0.57
fray
-0.57
POSITIVE LOGITS
discovered
3.29
unearthed
2.13
uncovered
2.07
discovers
1.95
discover
1.93
found
1.89
detected
1.85
noticed
1.79
discovering
1.73
discoveries
1.67
Activations Density 0.013%