INDEX
Explanations
numerical measurements, potentially related to technology or scientific data
references to numerical values or data points
New Auto-Interp
Negative Logits
urus
-0.61
neighb
-0.59
temptation
-0.58
book
-0.58
gifted
-0.57
stagnation
-0.56
Chronicle
-0.56
entry
-0.56
ide
-0.56
hers
-0.55
POSITIVE LOGITS
00
1.27
%-
0.97
50
0.96
%
0.93
rpm
0.92
345
0.87
%,
0.86
ACP
0.86
nm
0.85
70
0.85
Activations Density 0.108%