INDEX
Explanations
punctuation and conjunctions within the text
New Auto-Interp
Negative Logits
niana
-0.42
おきます
-0.41
olmak
-0.41
osomes
-0.40
gna
-0.38
Santana
-0.38
Hermans
-0.36
uzu
-0.36
tekem
-0.36
guapos
-0.36
POSITIVE LOGITS
שוליים
0.55
ſta
0.50
ftate
0.50
0.49
experts
0.49
مشين
0.48
according
0.47
ujednoznacz
0.46
argued
0.46
AppDelegate
0.46
Activations Density 0.373%