INDEX
Explanations
occurrences of the word "ith" and its variations in context
New Auto-Interp
Negative Logits
########.
-0.51
Thanksgiving
-0.49
Iron
-0.48
Pneum
-0.46
marion
-0.45
MECHAN
-0.44
Iron
-0.44
calab
-0.44
escar
-0.44
PDATE
-0.43
POSITIVE LOGITS
ith
0.88
ITH
0.71
iths
0.67
observations
0.57
ithi
0.52
Exactos
0.49
épisode
0.48
nil
0.48
episodes
0.48
️
0.47
Activations Density 0.206%