INDEX
Explanations
words related to numerical quantities and measurements
instances of numbers and quantitative measurements
New Auto-Interp
Negative Logits
Grateful
-0.59
Wolfe
-0.55
Rivera
-0.54
Franks
-0.53
Ange
-0.53
Schr
-0.52
Floyd
-0.51
Walters
-0.51
Ruby
-0.50
Pokémon
-0.50
POSITIVE LOGITS
trak
0.85
oother
0.76
qus
0.73
arser
0.71
versa
0.69
iliate
0.68
theirs
0.68
ocre
0.68
thereto
0.68
anooga
0.66
Activations Density 1.512%