INDEX
Explanations
phrases related to scientific research and organisms
New Auto-Interp
Negative Logits
aga
-0.59
Ī
-0.57
IJ
-0.56
eries
-0.56
Ĺ
-0.54
wings
-0.54
ritical
-0.53
tan
-0.53
brid
-0.52
tons
-0.51
POSITIVE LOGITS
elsewhere
1.01
anywhere
1.00
alongside
0.97
everywhere
0.92
outdoors
0.91
concurrently
0.89
throughout
0.88
atop
0.86
here
0.85
across
0.84
Activations Density 2.844%