INDEX
Explanations
numbers related to quantities or specifications
numerical data and statistics
New Auto-Interp
Negative Logits
¯
-0.74
hers
-0.72
theirs
-0.70
actionDate
-0.66
ours
-0.65
Scourge
-0.59
ebin
-0.58
iterator
-0.56
affair
-0.55
plin
-0.54
POSITIVE LOGITS
countries
1.04
th
1.02
different
0.98
nm
0.95
%-
0.92
languages
0.91
%
0.89
nations
0.88
Countries
0.87
categories
0.86
Activations Density 0.168%