INDEX
Explanations
links or references to URLs
elements related to technical specifications and quantitative details
New Auto-Interp
Negative Logits
ÂŃ
-0.84
Enlarge
-0.79
toggle
-0.63
Adolf
-0.59
uble
-0.58
Canaver
-0.56
ÂŃ
-0.54
½
-0.53
chanted
-0.53
century
-0.53
POSITIVE LOGITS
thru
0.81
independ
0.78
devs
0.75
CoC
0.75
alot
0.74
doesnt
0.74
didnt
0.74
tho
0.73
aforementioned
0.70
dont
0.69
Activations Density 1.875%