INDEX
Explanations
titular words in titles or phrases
titles or references to named entities and specific designations
New Auto-Interp
Negative Logits
ITNESS
-0.69
shapeshifter
-0.68
saline
-0.65
net
-0.65
Observer
-0.64
procedural
-0.64
76561
-0.64
gravity
-0.62
magnetic
-0.61
downhill
-0.60
POSITIVE LOGITS
Tit
1.11
itles
1.06
ename
0.85
resy
0.84
reon
0.84
iago
0.81
indal
0.80
arus
0.80
zek
0.80
orius
0.77
Activations Density 0.004%