INDEX
Explanations
proper nouns or named entities
instances of numerical data or dates
New Auto-Interp
Negative Logits
destro
-0.86
hement
-0.80
helicop
-0.73
Jagu
-0.72
nesday
-0.69
undai
-0.67
favoured
-0.65
acknow
-0.63
abolish
-0.63
owicz
-0.62
POSITIVE LOGITS
Enlarge
0.83
Legendary
0.80
Overview
0.80
Temperature
0.76
PRESS
0.75
Includes
0.74
Volume
0.74
Updated
0.73
versions
0.73
ALT
0.72
Activations Density 0.214%