INDEX
Explanations
verb forms in various tenses, indicating actions or states
New Auto-Interp
Negative Logits
Vers
-0.72
SPD
-0.69
Notting
-0.65
Benn
-0.64
Dunk
-0.64
dismant
-0.63
predicate
-0.62
Hust
-0.62
millenn
-0.61
juven
-0.59
POSITIVE LOGITS
tics
0.91
senal
0.89
Ĥİ
0.89
guiActiveUnfocused
0.86
rael
0.80
hyde
0.79
abella
0.76
ifles
0.75
tis
0.74
ws
0.74
Activations Density 0.306%