INDEX
Explanations
mentions of educational institutions and universities
New Auto-Interp
Negative Logits
myſelf
-1.15
itſelf
-1.12
+#+#
-1.03
themſelves
-1.03
Monfieur
-0.99
Shakspeare
-0.98
doubtnut
-0.97
himſelf
-0.97
يتيمه
-0.95
auffi
-0.95
POSITIVE LOGITS
University
0.72
,
0.61
College
0.58
University
0.56
.
0.56
an
0.54
"
0.52
university
0.52
<eos>
0.50
0.49
Activations Density 0.145%