INDEX
Explanations
descriptions or discussions related to real or concrete things or concepts
New Auto-Interp
Negative Logits
fu
-0.71
wich
-0.70
ashore
-0.67
Beware
-0.65
Carbuncle
-0.64
bay
-0.63
limit
-0.63
zy
-0.62
Caf
-0.62
oner
-0.61
POSITIVE LOGITS
ity
1.02
izations
1.01
ITY
1.00
isation
1.00
idad
0.97
izable
0.95
ities
0.89
ignment
0.86
isations
0.85
ité
0.83
Activations Density 14.002%