INDEX
Explanations
technical numerical data and references
numerical references, particularly percentages and years
New Auto-Interp
Negative Logits
gets
-0.84
maid
-0.70
isters
-0.69
grass
-0.68
unks
-0.67
wich
-0.67
meet
-0.67
arent
-0.66
ener
-0.65
uden
-0.64
POSITIVE LOGITS
Ñĭ
0.93
ropolitan
0.78
80
0.77
âĸĪ
0.74
%:
0.72
eers
0.71
chan
0.71
rpm
0.69
eenth
0.69
70
0.68
Activations Density 0.065%