INDEX
Explanations
names of individuals
proper nouns, specifically names and places
New Auto-Interp
Negative Logits
ERA
-0.84
================================================================
-0.83
âĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢ
-0.83
é»Ĵ
-0.82
ODUCT
-0.80
åŃ
-0.74
ãĥ¼ãĥĨãĤ£
-0.73
Colossus
-0.71
EngineDebug
-0.68
Progress
-0.67
POSITIVE LOGITS
Naj
0.96
wcs
0.90
akh
0.89
ees
0.87
ee
0.86
merga
0.84
daq
0.83
jar
0.83
eb
0.82
odies
0.82
Activations Density 0.014%