INDEX
Explanations
dialogue and conversational phrases
introductory like, and gotta/wanna
New Auto-Interp
Negative Logits
HasFactory
-0.48
ьажоргаш
-0.46
NSCoder
-0.44
GEBURTSDATUM
-0.42
cektir
-0.42
coloro
-0.42
organisations
-0.40
🔰
-0.40
OMITTED
-0.40
כלל
-0.40
POSITIVE LOGITS
WithIOException
0.65
GONNA
0.61
Gonna
0.55
gonna
0.55
gotta
0.54
gonna
0.54
outta
0.53
Wanna
0.52
yeah
0.52
wanna
0.52
Activations Density 0.012%