INDEX
Explanations
Latin characters with accent marks
numerical or coded representations, possibly index or reference data in a list format
New Auto-Interp
Negative Logits
iott
-0.65
bert
-0.63
thirsty
-0.63
atten
-0.60
bringer
-0.59
bour
-0.59
marks
-0.59
veter
-0.57
downs
-0.57
sb
-0.56
POSITIVE LOGITS
âĵĺ
0.91
³³³³
0.83
Introduced
0.80
================================================================
0.77
³³³
0.75
³³³³³³³³³³³³³³³³
0.71
NES
0.70
âķIJâķIJ
0.70
³³³³³³³³
0.69
052
0.69
Activations Density 0.128%