INDEX
Explanations
terms related to the act of discovering or the concept of discovery itself
New Auto-Interp
Negative Logits
I
-0.73
отношению
-0.72
T
-0.69
A
-0.67
S
-0.64
R
-0.63
-0.63
M
-0.60
espan
-0.60
rekli
-0.59
POSITIVE LOGITS
discoveries
1.85
discovery
1.82
Discover
1.66
Discovery
1.66
DISCOVER
1.56
Discovery
1.54
discovers
1.48
discover
1.48
discovery
1.46
discover
1.46
Activations Density 0.060%