INDEX
Explanations
proper nouns such as names of institutions and people
specific roles and title attributes related to various professions and measurements
New Auto-Interp
Negative Logits
.)
-0.72
.):
-0.71
Parameters
-0.62
.)
-0.59
iatus
-0.57
Mos
-0.56
Ange
-0.55
DEN
-0.55
WAR
-0.54
pard
-0.54
POSITIVE LOGITS
etc
0.73
captcha
0.69
ilion
0.67
utils
0.65
enture
0.59
cheat
0.57
ModLoader
0.57
vertisement
0.56
Tumblr
0.55
inclusion
0.55
Activations Density 0.928%