INDEX
Explanations
contractions of the form "verb + 's", typically indicating possession or omission of a letter
possessive constructions and questions about different subjects or topics
New Auto-Interp
Negative Logits
velop
-0.70
oak
-0.62
igraph
-0.62
onent
-0.60
sha
-0.60
UME
-0.59
erers
-0.58
cember
-0.57
wards
-0.57
onz
-0.56
POSITIVE LOGITS
happened
0.97
happening
0.88
gonna
0.87
transpired
0.84
pace
0.79
happ
0.75
gotta
0.73
Done
0.71
REALLY
0.68
done
0.68
Activations Density 0.047%