INDEX
Explanations
references to third parties or third-party interactions
New Auto-Interp
Negative Logits
consig
-0.56
warzys
-0.49
λεί
-0.48
iramente
-0.48
consign
-0.47
aspi
-0.47
baijan
-0.46
odyne
-0.46
belline
-0.45
ikan
-0.45
POSITIVE LOGITS
Third
2.08
Third
2.05
third
1.97
third
1.96
THIRD
1.84
terceiro
1.73
THIRD
1.71
thirds
1.69
III
1.67
thirds
1.66
Activations Density 0.083%