INDEX
Explanations
phrases related to personal awareness or realization
pronouns and their use in conveying personal beliefs or experiences
New Auto-Interp
Negative Logits
aston
-0.78
haps
-0.70
noon
-0.67
phans
-0.65
elia
-0.62
bender
-0.62
Mont
-0.62
quartered
-0.60
hattan
-0.59
WHO
-0.58
POSITIVE LOGITS
wrought
0.86
happ
0.84
happened
0.81
happen
0.81
learnt
0.79
happens
0.75
've
0.74
wanted
0.74
learned
0.74
'd
0.72
Activations Density 0.133%