INDEX
Explanations
the abbreviation "OD" at different levels of activation
references to specific university-related terminology or acronyms
New Auto-Interp
Negative Logits
Ki
-0.72
Pi
-0.71
Welsh
-0.68
Canter
-0.67
Hurricanes
-0.66
Panthers
-0.65
Pens
-0.64
Crest
-0.64
Wem
-0.64
Athena
-0.64
POSITIVE LOGITS
IUM
1.13
ouble
1.03
ependent
1.02
ocument
1.00
gins
1.00
CAST
0.98
OME
0.96
irect
0.95
OD
0.94
sense
0.93
Activations Density 0.009%