|
EVALUATING AMERICAN INDIAN EDUCATION
By
Thomas (Tom) R. Hopkins, Ed.D.
© 2003
Research and Management Specialists, Inc
4106 Tara Ct. NE
Albuquerque, NM 87111
January 2003
Introduction
Evaluating American Indian education requires the best of professional
educators, especially as related to the principles of fairness
and accuracy. The contemporary situation in American education
is dominated by high stakes testing and state level accountability
programs in which American Indian education receives, mostly, unfair
treatment based on culturally biased, inaccurate test data. However,
it is possible within high stakes testing and state accountability
programs to provide fairness and accuracy to American Indian communities.
We hope in this article to suggest to evaluators how this can be
accomplished.
The 2000 Decennial Census (2002) reports 4.1 million people who
self-reported being American Indian or part American Indian. The
4.1 million is 1.5 percent of the total U.S. population. 2.5 million
of these were American Indian or Alaskan Native only, which is
.9 percent of the U.S. population. An additional 1.6 million reported
being part American Indian. The BIA provides services to “American
Indians” who are 25 percent or more Indian by blood quantum.
In this respect, “American Indian” is a legal term
which applies to the BIA (and the U.S. Public Health Service Division
of Indian Health) and its programs including over 200 tribes on
reservations which encompass over 50 million acres of land. The
BIA Office of Indian Education Programs (OIEP) web site describes
its Indian education base as: “OIEP has responsibility for
185 elementary and secondary schools and dormitories as well as
27 colleges. Our colleges, schools and dormitories are located
on 63 reservations in 23 states across the United States serving
approximately 60,000 students representing 238 different tribes.”
Nationally, the American Indian population is complex, often
confusing to non-Indians and offers education evaluators a great
challenge to identify with reasonable accuracy who would be included
in an evaluation. Nonetheless, there have been several national
evaluations of American Indian education.
American Indian Education
Evaluation, An Historical Approach 1
For purposes of this paper, evaluation is defined as providing
valid education information to decision makers. Evaluators don’t
make administrative decisions. Evaluation differs from research
primarily in the area of control of the education environment.
Research usually involves a very limited number of subjects within
a tightly controlled environment investigating strictly defined
interventions. Even though evaluations are large in scope with
limited control of the environment, they produce valid information
on which decision makers can make decisions. In several instances,
education research reports will be included in the literature.
In developing an historical approach to American Indian education
evaluation it is appropriate to start with the first quarter of
the twentieth century. We start at this time as it was during the
first quarter of the twentieth century that American educators
began to apply scientific methodology to education. Jerome V. D’Agostino
(2000) tells us that something similar to standardized tests were
first introduced in Boston in 1845. During the nineteenth century
scientifically developed education instruments were not applied
to American Indian education. It was during the first quarter of
the twentieth century that scientific instruments were applied
to American Indian education. D’ Agostino also points out
the importance that Ralph Tyler and “Progressive educational
reformers,” played important roles in early twentieth century
testing. We will learn that Tyler also influenced American Indian
education evaluation. Ironically, Robert Salvin (2002) in a recent
article espouses the belief that, at last, scientific methodology
in education will drive transformation of educational practice
and research in the twenty first century. Salvin claims that science
has yet to dominate U.S. education processes.
The application of scientific methodology, borrowed from the
natural or hard sciences such as physics and chemistry and the
employment of advanced mathematics, is not a bad development. The
application of science to education has been a creative and intellectually
respectable contribution to education. Scientifically developed
education instruments, like all science, requires specificity and
intellectual rigor. It focuses thought and education activities
in a very specific manner. Scientific education instruments are
usually based on a specific target population such as the total
U.S., college level students, etc. If applied to that population
they can be helpful to professional educators working in schools
specific to the targeted population. On the other hand, achievement
tests have historically been developed for the U.S. as a whole,
but have reflected only the Eurocentric2 based
culture and related population. Without developing special norms
for sub-cultures, such as Navajo, Eskimo, Sioux, etc. achievement
tests produce inaccurate scores.
Preliminary, Background Education Research
Robert Havighurst (1957), the noted education researcher at the
University of Chicago, contributed enormously to the education
research on American Indians. His article in the 1957 issue of The
Annals is a good place to start. In this article Havighurst
refers to a 1928 research project on intelligence. The following
paragraph makes the point of cultural fairness and standardized
instruments.
Klineberg administered the Pintner-Paterson
series of six tests to Indian and white children on the Yakima
Indian reservation and found (1) that Indian children took longer
with form boards but made fewer errors, (2) that comparison of
Indian and white groups in terms of total number of points obtained
on the Pintner-Paterson Point Scale showed no differences between
the two because the Indians made up in accuracy for their inferior
speed, and (3) that correspondence of score with degree of white
blood was lacking. Whereas preceding investigations pointed to
the superiority of the whites over the Indians on tests of intelligence,
Klineberg’s study is among the first to offer contradictory
evidence and to suggest that test performance may be affected
by cultural factors.
Klineberg published his article in Archives of Psychology,
1928.
Havighurst also includes intelligence test results as related
to American Indians. Said tests developed for Eurocentric, English
speaking populations, have not produced reliable American Indian
IQ’s. However, the several non-verbal tests, including the “Goodenough
Draw-A-Man Test,” used since 1926, compare Indian children
with non-Indian children in a more equal and fair manner. In the
early 1940’s Havighurst did intelligence research on Navajo
children. He had this to say about his work.
The existence in the Indian's experience
of parallel but different cultures is illustrated by a study
made in the early 1940's of Navaho Indian children having different
degrees of contact with white culture in two different communities?
The Shiprock community is located on a well-traveled highway
and has substantial economic relations with neighboring white
communities. This was probably the most acculturated of Navaho
communities in the early 1940's. In contrast, the Navaho Mountain
community was probably the least acculturated. Situated far away
from any highway, in an area almost never visited by white men.
Many of the Navaho children had never seen a white person except
the school teacher: and many of the children never attended school.
A few of the older children attended a boarding school in Tuba
City, which was on a secondary highway, but still relatively
apart from the world of white Americans.
The Shiprock children were closer to the
average of white American children on a number of tests of attitudes
and abilities than were the Navaho Mountain children, but still
the Shiprock children showed many significant differences from
the white children, indicating the persistence of the Indian
culture in their lives (pp. 107-108).
Over 60 years later, things haven’t changed at all regarding
the use of intelligence tests that Havighurst reported. Havighurst
benefited from the BIA’s national Education Program office
Chicago location. In 1941 the “Indian Education Research
Project,” was initiated at the University of Chicago by the
BIA (Zintz, 1969, pp. 131-132.).
Perhaps one of the earliest and most in-depth study of one American
Indian education group was conducted by two Stanford University
researchers, Dewey Anderson and Walter Eells (1935). They spent
two years trekking throughout Territorial Alaska applying standardized
tests of achievement and intelligence to Eskimos, Indians and Aleuts.
Frustrated with the dramatic difference between Alaskan Native
language, cultural and social conditions and the Stanford Achievement
test or the Stanford-Benet Intelligence Test, the researchers attempted
on-the-spot modifications in order to develop different data. It
was interesting in that a main source of information on Alaskan
Natives was mostly provided to the researchers by local non-Natives,
many of whom operated isolated trading posts. It is important that
standardized evaluation instruments provided only minimally useful
results. The most important finding of this massive work was that
Alaskan Natives were (and are) very different culturally from the
norming groups of the instruments, consequently, standardized instruments
and related statistics based on norming groups, should be used
on American Indians with great caution.
Up until about 1946, the BIA headquarters office for its national
education program was located in Chicago. It is not surprising
that the University of Chicago played an important role in evaluating
American Indian education. With the advent of the famed Meriam
(1928) report and the change in administration to Franklin Roosevelt,
important changes in American Indian education were implemented.
Roosevelt appointed John Collier as Indian Commissioner (Philp,
1981), who appointed Progressive Educator Edward Everett Dale as
his Education Director. Dale began a long influence on BIA Education
by the Progressive Education movement (Goodenow 1980). When Dale
left the BIA another Progressive Educator, Willard Beatty (Squire
1972, p. 214), was appointed as Education Director. Collier, who
espoused “Social Engineering,” (Boyce, 1974) and his
Progressive Education Directors of Education were partial to evaluation
data to prove effectiveness of Federal schools compared to public
schools.
Four National Evaluations, 1944 - 1970
In the contemporary standards based education reform for most
states, standardized tests are used in ways the test developers
say they should not be used. Standardized tests cannot measure
a total education program which includes affective and cognitive
domains. Further, to produce accurate evaluation data on a reform
program the curriculum content, teacher education and assessment
(tests) must be aligned (Schoenfeld, 2002). This is a simple, age-old
education principle which says that the content taught children
determines the content of the tests and that teachers are educated
to teach said curriculum content.
There is an extensive literature on evaluating American Indian
education most of which has been produced by serious and competent
professionals. There are three components of each evaluation that
lead us to our conclusion about fairness and accuracy in American
Indian education evaluation. First, each publication reflects a
purpose which we find is basic to the quality of the evaluation.
Second, though there has been less attention to who was included
in the evaluation, there was at least some statements defining
the American Indian population being evaluated. Third, methodology
while not receiving the discussion it deserved, was reasonably
reflected in the instruments and analyses.
There were four national evaluations of American Indian education.
All four were paid for by the BIA and had a major purpose of comparing
the relative effectiveness of the BIA education program schools
compared to public and mission schools and Indian and White students.
The first was contracted to the University of Chicago, the next
two to the University of Kansas and the fourth conducted without
university assistance, was contracted to the Southwest Regional
Laboratory, a Great Society creation. There was continuity between
the four evaluation that was provided by L. Madison Coombs, an
employee of the BIA’s Education office. Coombs was a major
role player in BIA education evaluation and research from about
1950 to 1970. The general design of the four evaluations was set
by the first evaluation. Because the first evaluation more or less
established the basic design for all four, it will be discussed
in more detail and its design used to describe the following three.
The Peterson Evaluation, Number One
A first national evaluation of the education of American Indians
(American Indians) was contracted by the BIA to the Education Department
of the University of Chicago. This evaluation started with a “Pilot
study” in 1944 and gathered data in 1945 and 1946. Shalier
Peterson (1948) in the Preface mentions that “Mr. Willard
W. Beatty, Director of Education for the Office of Indian Affairs,
consulted with Mr. Ralph W. Tyler, Chariman of the Department of
Education of the university of Chicago and asked the university
to provide assistance in planning this project, in guiding its
execution and in analyzing the data gathered.” Tyler also
contributed a section (pp. 113-117) to the book. Among others,
Hilda Taba was a Research Associate and Assistant Professor at
the University. It should come as no surprise that the BIA Education
Program had well defined objectives. Peterson quotes from the Civil
Service examination:
“The primary objectives of Indian
schools are: To give students an understanding and appreciation
of their own tribal lore, art, music, and community organization;
to teach students through their own participation in school
and community government to become constructive citizens of
their communities; to aid students in analyzing the economic
resources of their reservation and in planning more effective
ways of utilizing these resources for the improvement of standards
of living; to teach, through actual demonstration, intelligent
conservation of natural resources; to give students firsthand
experience in livestock management, use of native materials
in housing and clothing, in subsistence gardening, cooperative
marketing, farm mechanics, and whatever other vocational skills
are needed to earn a livelihood in the region; to develop better
health habits, improve sanitation, and higher standards of
diet with a view to prevention of trachoma, tuberculosis, and
infant diseases; to give students an understanding of the social
and economic world immediately about them and to aid them in
achieving some mastery over their environment; and to serve
as a community center in meeting the social and economic needs
of the community.” (p. 9)
Peterson states the purposes of the evaluation as:
The Indian Service-wide Testing Project,
reported in this monograph, had two main purposes: 1) to examine
the progress and achievement that the Indian students had mode
in various types of educational situations; 2) to examine those
factors which were thought to be related to the student’s
educational development and to uncover any other factors which
might prove to be related.
As pointed out above, the BIA was the major education institution
providing schooling to Indians on reservations. The purpose of
the evaluation was generally, in line with the famed eight year
study of progressive vs traditional education. The “New Deal” BIA
had instituted a new bicultural program, which was referenced above
in the purpose (Szasz, 1977). The evaluation question was, “Did
Indian students exposed primarily to the new program do better
than those who were not exposed to the program?” As Tyler
in his section reports:
The tests results show that children in the
fourth grade who have had their total schooling under the new
program have made relatively greater progress than pupils in
the 8th or 12th grades who have had but a part of their schooling
under the new program. This is true in each of the major fields,
reading, writing, arithmetic, natural resources and health and
safety. More specifically, the children coming through the new
program gain not only knowledge of matters related to life in
the Indian home and community but at the same time acquire greater
competence in the tools of learning. By dealing in school with
matters of interest to Indian children that they can actually
use in their own lives, not only do they learn about these things
but they acquire greater skill in reading, writing and arithmetic
as these abilities are used in study of matters that are interesting
and helpful to them. (p. 116)
Purpose. The purpose of this evaluation was
to evaluate the BIA Education program and make comparisons between
BIA and other schools in order to determine if the BIA progressive
education program was as good as other programs. This was a large
scale evaluation intended for use by administrators and policy
level officials.
Indian Students. The Indian students included
in the evaluation attended BIA, public and mission schools. Students
in grades 4, 8 and 12 were evaluated. Indian students met the BIA
legal requirement of being “Indian” with 25 percent
or more by blood quantum. The report is very detailed in the instruments
used for each grade level. There were 5,324 students attending
six different types of schools. This was a “National” evaluation
in that it evaluated the BIA Education program nationally. It did
not include all American Indian students attending schools throughout
the U.S. While the evaluation was longitudinal, 1944 – 1946.
Students were not tracked individually. The report states, “In
interpreting the tables and charts, it is necessary to keep in
mind that these data describe three separate groups of students
instead of the same students at three periods.” (p. 29)

Figure 1 Peterson Data |
Curriculum. At that time, 1946, the U.S. Indian
Service (USIS) had specific curriculum guides. Its Education purpose,
as described above, was clearly stated. Evaluators tried to align
tests with some aspects of the curriculum. Instrumentation. Fourteen different evaluation
instruments were used. Three of these were standardized tests two
pertained to background information and five special instruments
which were based on the USIS curriculum. Scientific methodology
was used throughout. Chapter Two contains details on instrumentation.
The USIS instruments as well as the “Free Writing” test
samples are printed at the back of the report.
Background Data. Test data was linked to the “Background
Questionnaire.” Background data was very important to interpretation
of test results. Chapter Three, “Differences in Student Backgrounds,” provides
a detailed discussion of the links to test data, including cultural
backgrounds of students. It is interesting that the evaluators
of 1944 thought test scores should be related to background characteristics.
Indian Blood quantum. As mentioned above, blood quantum is a
legal term related to attendance at BIA funded schools. It was
assumed in the Peterson evaluation that the greater the degree
of Indian blood, the closer the individual was to the Indian culture.
The report states, “The data do not justify a conclusion
that school success is limited by degree of Indian blood.” (p.
29)
Statistics. Raw scores from standardized tests were used to illustrate
relationships between scores and background characteristics. Median
scores were illustrated graphically as was the range of raw scores.
In fact, medians were used most often in reporting raw test scores.
Number of students on which the graphs were based were also included.
Percentages were used in reporting background characteristics.
Range of data were also presented with a three part division between
the bottom third, the middle third and the top third for each instrument.
Figure 1, above, is from the Peterson report.
Test Norms. Test norms, or “National Norm Equivalents” were
reported. This is nowadays referred to as “Grade Equivalents.” It
is interesting to note the comment about the use of test norms
with Indian students.
The "National Norm Equivalents" have not been provided
with the idea of suggesting any levels of attainment that are
expected. National norms are commonly based upon either urban
school children
or, in some cases, a combination of urban and near-urban education.
In view of this, it is obvious that national norms would
not be the proper yardstick to use as a means of measuring or
comparing
achievement in this Indian education study. (emphasis
added)
It
should be repeated here that the students tested in the Indian
evaluation study all come from a rural background and were taught
in schools having a rural environment, and therefore these students
should not be compared with the urban students, whose environment
and curriculum are recognized to be quite different. The national
norms do help to provide some concept of the range of abilities,
and they also give one an understanding of the increments between
grade levels in the urban school population. The national norms
show the achievement of the rural students to be, in some instances,
much higher than one might expect. (p. 59)
Interestingly enough, the general rule expressed above about
the 1944 – 46 evaluation is just as valid today as then.
The graphic presentations juxtaposed, side by side, raw score data
and test norm data. The comparisons are interesting.
The Anderson Evaluation, Number Two
The second national evaluation was conducted by Anderson, Collister
and Ladd (1953) at the University of Kansas. By the time this evaluation
started Madison Coombs, a former Kansas county school superintendent,
had joined the BIA Education program and was stationed at Lawrence,
Kansas on the campus of the then Haskell Institute, now the Haskell
Indian University. Coombs was instrumental in selecting Anderson
to lead the evaluation and was also instrumental in setting the
design for the evaluation. Coombs is acknowledged in the report
frontispiece. It borrowed a great deal from the University of Chicago
evaluation.
- The University of Chicago and Ralph Tyler were referenced
as providing the background to the evaluation.
- The same
purpose and philosophy as stated by Peterson were used by Anderson.
- The
same instruments were used as much as possible.
- Background
characteristics were important to interpretation of data.
- It
was assumed that the same education program as described in
the earlier study was still in effect.
- In 1950 Coombs established a “Service wide Testing
program,” which provided data to Anderson. Data from the
1950 administration, only, were used.
- Students were not
tracked
- Comparisons were made to the Chicago data.
Statistics. Since computers were used with data analysis different
statistical analysis were also used. The Standard Deviation of
the Mean was the main statistic used by Anderson. Statistical significance
was also used as were “F” values and “t” ratios
and the “Behrens-Fisher “d” test. In fact, Chapter
III is titled, “Statistical Treatment of the Test Data.” Further,
Chapter IV is titled, “Results of the Statistical Treatment
of The Test Data.” “Raw or Untreated Scores” were
used for statistical procedures. The first paragraph in Chapter
IV explains the treatment of data.

Figure 2 Anderson Data |
The results of the statistical treatment of the test data* obtained
by the administration of twelve tests to the eighth grade and twelve
tests to the twelfth grade will be discussed under three headings:
(1) comparisons of achievement by means of line graphs for the
nine geographic areas and for the six types of schools for grades
eight and twelve, (2) comparisons of achievement by means of the
percentage of overlap between grades eight and twelve where the
tests were common to both grade levels, and (3) comparisons of
achievement by means of critical ratios for the types of schools
for grades eight and twelve. These three types of comparisons were
made in order to provide realistic answers to the basic questions
stated in Chapter II. (p. 34)
* Raw or untreated scores.
Figure 2 illustrates Anderson’s graphic illustrations of
test data. Anderson’s more sophisticated statistics limited
dramatically communication of findings to program and policy officials
By the time the Anderson report was published, 1953, Willard
Beatty had left the BIA and been replaced by Hildegrad Thompson.
Thomspon did not have the sterling Progressive Education credentials
as Beatty, but she appeared to follow the philosophy of progressive
education.
The Coombs Evaluation, Number Three
The Coombs evaluation reflects a clear departure from the previous
two evaluations. The previous two evaluations cite the 1928 Merian
report as a benchmark for assessing the education of American Indians.
Coombs also cites the Merian report. However, the subtitle to Coombs
evaluation is, “A Study of Interracial Differences.” Basically,
Coombs changed the focus of the evaluation away from assessing
the effectiveness of the BIA education program as compared to others
to assessing , “Learning,” and achievement of Indians
and whites as measured by achievement tests. In this study the
California Achievement Test was used, which is another departure
from the previous two studies which tried to maintain the integrity
of test instruments between evaluations. In this respect, the Coombs
evaluation becomes almost a new approach to evaluating American
Indian education.
Basic policy regarding the education of American Indians is set
by the Congress. Lyman Tyler (1973), a history professor at the
University of Utah, wrote a book on Indian policy and the Government
published it. During the 1950’s, the period in which the
Coombs evaluation took place, the policy of the Federal Government,
as determined by the Congress, was to “Terminate” Federal
services to Indians. Regarding education, Termination meant turning
over BIA schools to the public schools. It should be remembered
that the previous two studies took place under the influence of
the “New Deal” and the progressive education movement.
With Coombs the foundation of policy shifts to something titled, “Inter
racial.”
Another very important feature of the Coombs evaluation was that
the overall design and work was done “In House.” This
meant that Coombs, as a Civil Servant, an employee of the BIA,
administered and controlled the evaluation, including the writing
of the report. The professors at the University of Kansas, who
shared authorship, provided technical assistance in the form of
data entry and statistical analysis. Otherwise, this was an “In
House” evaluation. The two previous evaluation were administered
by higher education institutions and professors with the BIA providing
support. The Coombs evaluation reversed the roles of the Government
and higher education, and it is reflected throughout the report.
Willard Beatty left the BIA and Hildegard Thompson took his place
with the responsibility to implement Termination.
Major Characteristics of the Coombs Evaluation
Data Gathering. The two previous evaluation were very clear and
explicit about who was included in the evaluations and when the
data was gathered. Coombs does not reflect such a clear statement
about data gathering. But, when one digs deeper it comes as no
surprise that data gathering was controlled by the BIA field administrators.
While Coombs reports a total of 23,608 and the breakdown by racial
groups and schools, he was not clear on the time data was gathered.
Background characteristics were included in the data structure.
Time and Data Organization. Coombs had to allow the field administrators,
Agency School Administrators, to control who was to be included
in the evaluation as well as when data was to be gathered. The
BIA, starting with the Termination period, was divided into “Area
Offices,” and it was these offices that determined that no
total BIA administration of tests would take place during the 1951 – 1954.
Without knowing the organizational structure of the BIA, one could
think this was a longitudinal study, which it was not. Table 3-a
(p. 18) provides the administrative areas and the number of “pupils” tested
in each. There were six administrative areas at the time of the
evaluation and all were tested in the fall of a year. While the
evaluation program was initiated in the fall of 1951 and continued
through 1954, each administrative area was tested only once in
the fall of a year.
Grades Tested. Grades four through twelve were included in the
evaluation whereas previous evaluation tested grades four, eight
and twelve. The rationale for this was one of the purposes of the
evaluation which was to improve learning and instruction at the
local levels. Grades one through three were excluded because there
did not seem to be a good achievement test available for the primary
grades.
Evaluation Results. Coombs, like the other evaluations,
is balanced in the discussions regarding test results, race and
Indian cultures.
He reports that, according to test scores, whites make higher scores
than Indians. He, like the others, includes important discussions
of age-grade results. Indians, because of language spoken in the
home, were more often than whites, older for their grade placement.
Pupils were old for their age and made lower test scores. Coombs
established an “Order of achievement
of Race-School Groups,” which was:
1. White children in public schools
2. Indian children in
public schools
3. Indian children in Federal schools
4. Indian children in
mission schools (p. 40)
Chapter VI is an excellent discussion of cultural and environmental
factors on achievement. Blood quantum is again discussed but found
not to be a basic factor regarding achievement. Coombs also made
note of test scores and progress through the grades wherein Indians
fall further behind whites as the progress through school.
Predictive Use of Tests. Coombs devoted considerable space to
using tests to predict educational outcomes of Indians. Since there
were no longitudinal data on which to base conclusions, his discussion
was speculative. Haskell vocational program which enrolled high
school graduates wanted to use test scores to predict probable
success.

Figure 3 Coombs Data |
Statistical Procedures. Because of the importance
of administrative areas in the evaluation, statistical procedures
accommodated the
politics of them. Coombs developed a “Composite Norm.” for
the different administrative areas. It was based on means.
The composite norms were established by calculating the raw
score mean and standard deviation for the composite group on
each skill,
and total score, for each grade level. Raw score values were
then fixed at the mean and plus-and-minus one standard deviation
from
the mean for each grade. Assuming that the distributions are
normal, approximately 68 percent of the cases fall between the
raw scores
at plus-and-minus one standard deviation from the mean; about
16 percent fall above plus 1 S.D. and the remaining 16 percent
below
minus 1 S.D. These three strata of achievement are shown in the
scale at the top of each figure and are designated as “average,” “below
average,” and “above average.” This scale serves
as a standard for comparison.
The graphic presentation of data is shown in Figure 3. T scores
are also figured on raw scores. Almost as an after thought, Appendix
D, presents additional statistical analyses on T scores level of
significance. Obviously, statistical procedures other than means
would not go over well with the area administrators.
The University of Kansas assisted Coombs with database management
and statistical procedures..
Bass Evaluation, Number Four
Willard Bass (1971) , working at the Southwestern Cooperative
Educational Laboratory, a creation of Lyndon Johnson’s Great
Society, conducted the fourth and final evaluation in the series
that started at the University of Chicago in 1944. It is remarkable
in American education that we find some continuity that was initiated
with the Meriam report in 1928 and was concluded with the Bass
evaluation. It is suggested that this little known continuity which
drew heavily from progressive education’s concern for effectiveness
of school types, is a remarkable U.S. education phenomenon.
As much as any one individual, Madison Coombs was responsible
for this remarkable continuity. He received no credit whatsoever
for making the continuity possible. Coombs picked up on the Chicago
evaluation when he joined the BIA staff at Haskell and worked with
Anderson on his evaluation. Anderson makes clear, unambiguous reference
to the Chicago evaluation. Then, Coombs and his evaluation, an
internal evaluation conducted by BIA from 1951 – 1954, continued
evaluating school types. Bass makes clear the role of the Coombs
evaluation as a benchmark for his evaluation. Regarding the Anderson
evaluation and the Bass evaluation, within the BIA, Coombs was
the “Contract Officer’s Representative,” which
means that he was the main Education officer overseeing the evaluations
from within the BIA. The Bass evaluation therefore reflects the
last study of an evaluative concern for the effectiveness of schools
types that was initiated in 1928 and continued through to 1970,
a period of 42 years.
Structure and Content of the Bass Evaluation
Research Design and Approach. In contemporary understanding of
education research, the Bass evaluation was designed and conducted
with disciplined strictness. It employed sophisticated statistics
on a sample of students who attended a sample of schools. It used
several scientific instruments including the California Achievement
Test and the California Test of Mental Maturity as well as several
others.
Individual Differences. “Background Characteristics” were
included in all the evaluations and Bass refers to them as “Individual
Differences.” Achievement between schools was controlled
and achievement according to individual differences was also controlled.
While there were no statistically significant differences between
types of schools, there were also no differences in school effectiveness
when the same types of students were compared. A low achieving
students who had common background characteristics in a public
schools made no higher achievement than a low achieving student
in a BIA on-reservation school.
Using a Sample of Schools. Bass used only a selected, not a random,
sample of the different types of schools. This placed the entire
evaluation within the world of probability. The previous evaluations
evaluated hundreds of Indian children and youth whereas Bass selected
only a sample that would reflect the administrative areas of the
BIA, which was in keeping with the Coombs structure.
Using a Sample of Students. A stratified sample of 3,375 students
in grades nine through twelve were included in the evaluation.
The students were tracked from year to year. In the spring of 1970,
837 students were tested. It is a safe assumption that the American
Indian students met the BIA blood quantum requirement inasmuch
as Madison Coombs selected the schools for Bass so they would reflect
BIA administrative areas.
Longitudinal Evaluation. The Bass evaluation started in the fall
of 1966 and continued through to the spring of 1970. A “Pre” and “Post” test
was administered each fall and spring for four school years, 1966-69
through to 1979-70.
Secondary Education. Bass evaluated only high schools, grades
nine through twelve. Perhaps as much as any part of the evaluation,
no discussion was made of the history secondary education of American
Indians. Historically, the BIA high school curriculum and instructional
day were divided equally between vocational and academic instruction.
Enrollment in a vocation was required of each student. Academic
instruction was linked to the vocational curriculum that dominated
high school education from the nineteenth century through about
1957. In 1957 the BIA started changing its high school curriculum
by dropping the vocational requirement and implementing a stronger
academic curriculum.
No mention was made of the lack of secondary school facilities
available especially to Navajos and Alaskan Natives. For example,
as late as 1964 the only high school available to Alaskan Natives
was Mt. Edgecumbe, a boarding school located at Sitka. Further,
only a scattering of high schools were available to Navajos. The
Fort Wingate High School for Navajos, a boarding school, was not
completed until 1967, which was in the middle of the Bass study.
Otherwise, Navajos attended public high schools located on the
Navajo Nation lands only to the extent that they were being constructed.
Regarding Navajos, the lack of elementary schools on the reservation
severely impacted negatively secondary education numbers. The effect
of this condition was that there were literally thousands of American
Indians denied a high school education simply because there were
no elementary or high schools for them to attend. Obviously, had
attention been given to the high school age students not in any
school, a modified finding on achievement would have been reported.
There is a pattern in the history of evaluating American Indians
to include only those children and youth in school. If for some
reason a student is not attending a school they are excluded from
the evaluation. A focus of the evaluation on school types and achievement
says nothing about the children and youth who leave school during
the six though nine grades in large numbers.
Instruments. In addition to a questionnaire, which was included
in all four evaluations, Bass used the following instruments:
- California Achievement Test raw scores to measure
achievement
- California Test of Mental Maturity to used
as a scholastic aptitude control
- Monde Problem Check List
- School Interest Inventory
- Semantic Differential on ten
concepts, seven point scale
- California Psychological Inventory
- Vocational Aspiration
Scale
- Self Concept
Statistics. Means were developed from raw scores and analysis
of covariance were run. Tests of significant differences were run.
Bass succinctly states the conclusion of his evaluation:
In the light of the above findings it can confidently be concluded
that when initial individual differences in scholastic aptitude
and academic ability were controlled there was no evidence that
academic achievement of American Indian students was greater in
one type of school than in another. (p. 81)
Bass did not include graphic illustrations of data but did provide
numerous tables. A summary type table follows.

Observations from the Four Evaluations
It can be said that the four evaluations, all of which were funded
by the Federal Government’s Bureau of Indian Affairs, reflect
seven decades of common school education of American Indians, 1900 – 1970.
Some general observations from the evaluations can be made.
- The application of science to education through standardized
tests and other instruments was basic to all the evaluations.
None of the evaluators thought the direct application of test
norms
was fair to Indian students which resulted in the use of test
raw scores in all four evaluations. Test norm scores were reported
in all, but only as a minor observation. When test norms were
used
they reflected near grade level at grade four with a gradual
falling below norms until in grade twelve where Indians were
two years
behind. This general test norm pattern continues to this day.
- At
first, a comparison of progressive education as applied to
the education of American Indians was the purpose of the evaluations.
Schools were compared to schools. While the progressive program
was dropped starting with the Coombs evaluation, comparing schools
was continued.
- The statistical procedures used in all four
evaluations seemed to reflect the professional evolution of education
evaluation.
The 1944 evaluation started with means and the Bass, 1970 evaluation
used more sophisticated statistics. Using means of raw scores
was basic to all statistical procedures.
- Students involved in all the evaluations varied from
thousands to hundreds. All students were considered eligible
for education services from the BIA and as such, were legally “Indian.” This
means that all were one quarter or more Indian by blood quantum.
Blood quantum was used as a variable in three of the evaluations.
Bass did not use blood quantum as an important variable.
- The
education levels ranged from grades 4, 8 and 12 to all grades
to only high school. In this respect, there was inconsistency
in grades included in the evaluations.
- In 1944 the BIA had
a well defined curriculum which was tested with special instruments.
Anderson, the second evaluation,
also used USIS curriculum based tests. Starting with Coombs,
which was at a time during which major BIA high school curriculum
changes
were taking place, the BIA curriculum was not tested. Alignment
between standardized tests and curriculums was not in evidence
in any of the evaluations.
- These were all large scale
evaluations and were not considered useful at the classroom or
local levels. Coombs did mention that
he wanted his evaluation to be useful at the classroom level, but
no follow-up on usefulness was included in the report. Hopkins
(2002, Op. Cit.) has reported that the evaluations were not useful
at the classroom level. There is no evidence that the evaluations
impacted basic policy set by the U.S. Congress. There is no evidence
that Indian tribes used the evaluation results. It is suggested
that the evaluations were most interesting and useful to the academic
community, only.
- The basic evaluation design, which was
established by Ralph Tyler, is as useful today as when he implemented
it in 1944.
The overall evaluation projects reflect very professional and
useful large scale evaluation design, instruments, students,
statistics
and reporting for any population not to mention American Indians.
- The age-grade placement of Indian children remained
at least a year out of step with normal grade placement. This
was
discussed as emanating for the speaking of tribal languages in
the home. The first year of school was devoted to teaching children
English as a second language and by large numbers, Indian children
didn’t enter first grade until age seven. This had started
to change by 1970 but was still present for most of children
schooled by the BIA. Age-grade placement was also pointed to
as an explanation
for lower achievement as compared to whites.
1970 – 1979 Evaluations
The National Study of Indian Education (NSIE). There were no
large scale evaluations which took place from the Bass study culmination
in 1970 through the next 30 years. Perhaps the four studies and
the 30 years of Indian education covered therein established predictable
standardized test results. Late in the 1960’s, which was
during the time public schools assumed major responsibility for
educating American Indians, there emerged new concern for the well
being of American Indians, including their education. This concern
was motivated by Senator Robert Kennedy’s study of the education
of Indians (U.S. Senate, 1969). The Kennedy study generally described
the education of Indians as being in a sad condition. It is important
that the major contribution of the Kennedy study was creation of
and establishment of the Office of Indian Education in the U.S.
Department of Education. This office approaches Indian education
on a scale similar to that of the Decennial Census definition of
Indian. Within the Government, it was decided that a very large
research project should be conducted. This study became the NSIE
and was funded by the then U.S. Office of Education to the University
of Chicago (Fuchs and Havighurst, 1972). The research was carried
out by eight universities from 1967 through 1971. It is not surprising
that Robert Havighurst was the “Principal Investigator.” He
was assisted by a co-Principal Investigator, Estelle Fuchs, a cultural
anthropologist, from Bank Street College. The committee which developed
the content of the research made a special recommendation that
no new testing be undertaken and none was undertaken. An important
departure of the research pertained to American Indians included
in the study. They did use the BIA blood quantum definition and
therefore included self-reported Indians.

Figure 4 Fuchs and Havighurst Test Data |
Fuchs and Havighurst did discuss achievement as reported in extant
evaluations (Ibid. Chap. V). The Coombs evaluation was discussed
as the most extensive ever undertaken. The Bass evaluation report
was not available for the NSIE, but the Coleman (1966) study was.
The NSIE compared several testing reports and concluded that the
data, “. . . place Indian pupils just below the national
average during the first four school years, and then they drop
substantially,” (p. 123) There was an exception to this pattern
which was reported by John Bryde (1970). Bryde reported that Sioux children made at or above grade level
on standardized tests up and until grade six. After grade six they
fell below gradually until they were at least two years behind
in grade twelve. Bryde, and others, (Coombs) called this phenomenon
the, “Cross-over Effect.” Because the cross-over effect
was very different from previous evaluation data, Fuchs and Havighurst
went to the Bryde original data for analysis. Bryde had test scores
on Sioux children in grades 7, 6, 5 and 4. He had 147 eighth grade
children but only 65 of them were in the database in grade four.
Fuchs and Havighurst thought the data reduced confidence in the
quality the findings and the cross-over effect phenomenon. They
thought Bryde would have been better off to base his study on the
65 children who were in his database from grade four through grade
eight. The cross-over effect was, therefore, seriously questioned.
Coombs also reported the cross-over effect happening at grade four.
It should be noted that the cross-over effect is based on test
norms “Grade Equivalent” scores, not raw scores.
Fuchs and Havighurst illustrated the major testing studies as
presented in Figure 4. Of the five evaluations included in Figure
4, only the Rock Point and Rough Rock data is definitely based
on an English as a second language Navajo school population.
Fuchs, a cultural anthropologist from Bank Street College, obviously,
included a discussion of “Indian culture,” that she
suggests has over time evolved and has the following characteristics: “.
. . close family solidarity and cooperation with mutual support
among the kin-folk, cooperation rather than competition among members
of a given age group, belief in the values of a tribal tradition,
belief in tribal religion, and a tribal language.” (p. 129)
Educators undertaking U.S. national evaluations of American Indian
education could profitably employ this definition for an “Indian” culture.
All other research recognizes that culture is tribal based and
different one from the other.
Cross-Over Effect. The cross-over effect wherein standardized
test scores based on American Indian students score higher and
in some studies, above grade norms, and then begin a gradual reduction
in normed test scores until at grade twelve they are two levels
below norms, has been much discussed. Regarding achievement tests,
Indian children do produce higher test scores below grade four,
but not always at test score grade level. Hopkins (2002, Op.cit.)
has performed correlation’s on two Navajo schools which includes
test scores and teacher grades. He found no correlation’s
between test scores and teacher grades below grade four but strong
correlation’s starting at grade five and continuing through
high school. He suggests that primary teachers (he has been a primary
level classroom teacher) are more interested in child development
than academic achievement. Perhaps, also, the lack of focus on
academic achievement at the primary level is a good approach to
academic achievement.
BIA, 1970 – 1979
BIA Evaluations. From 1964 through 1979 the author was intimately
involved with the BIA education evaluation function at the Washington
level of the Education Program. First I was an “Education
Specialist, Secondary,” and served as a team member at boarding
school evaluation. Starting in 1968 I was Chief of the Division
of Evaluation and Development and starting in 1971 Chief of the
Division of Evaluation, Research and Development. Starting in 1958
as an administrator and continuing through to the time until he
retired, and afterwards, I worked for and with Madison Coombs.
I was thoroughly familiar with the use of standardized test to
evaluate the BIA’s Education Program and saw no need to use
them during the 1970’s. My approach to evaluation was to
use a program review format, which could or could not include standardized
test scores. Evaluations conducted were published as they were
completed. Most of all, evaluations were to be professionally conducted
with fairness and openness to all. Cultural fairness seemed most
important evaluation characteristics of all.
BIA Publications. The BIA created two important publications
that made available evaluations of all types. One was the “Research
and Evaluation Report Series,” and the other was the “BIA
Education Research Bulletin.” Both of these publications
made available on a regular basis a large number of evaluation
and research activities. It is important to note that Deyhle and
Swisher (1979) did not include any of these publications in their
review of educational research on American Indians. Dr. William
J. Benham (Creek) was head of the BIA Indian Education Resource
Center (IERC) that created the publications and Dr. Eugene Leitka
(Choctaw) was editor of the Research Bulletin. Several of the contributors
were American Indians.
The Research Bulletin was published quarterly from 1972 through
1979. A sample of the articles reflects concern for the evaluation
of effectiveness of most BIA Education programs. For example, Bruce
Pray (1975) addressed the use of standardized tests with Indian
high school students; Annabelle Scoon-Rosenbluth (1975) researched
the feasibility of test translation from English to Navajo and
Ethel Capps reported on her masters thesis on reading levels for
Navajo high school graduates (1977). Capps, an experienced reading
teacher, expressed the belief that timed tests were a determent
to test scores and if Navajos had more time they would make higher
scores. More importantly, Navajos didn’t need high school
grade level test scores in order to do well in school. The Bass
evaluation was reported, too. James C. Martin, an American Indian
with a Ph.D., contributed important articles on the Indian self
identity and was co-principal investigator of the Oklahoma Indian
Education Needs Assessment, which is the only quality statewide
needs assessment ever conducted (1977; 1979 and USBIA No. 43).
In summary, the total of 89 articles printed in the BIA Education
Research Bulletin provide an excellent summary of what the agency
thought important regarding evaluation and research of American
Indian education during the 1970’s.
The “Research and Evaluation Report Series,” also
reflected what the BIA thought important in evaluating American
Indian education. For example, Report No. 03 is titled, “Rough
Rock School Evaluation.” (USBIA, 1979, No. 01)3 Rough Rock
was the first Indian controlled school and the IERC was requested
by the school principal to conduct the evaluation. Then, there
was the “Oklahoma Indian Education Needs Assessment, “Results
of the Test of Proficiency in English as a Second Language in Grades
4, 5, and 6 of BIA Schools, School year 1971-72, “(USBIA,
1979, No. 13) and “Evaluation Report of the Indian Education
Administrator Training Program At the Universities of Harvard,
Penn State and Minnesota, Descriptive Analysis, 1975.” Almost
every BIA Education program initiative is covered in the Series.
There were nine individual schools evaluated.
The two publications also reflect an increasing BIA education
evaluation and research sophistication. Based on contemporary discussions
in the American Education Research Association’s (AERA) Educational
Researcher (Anderson 2002; Haywood Metz and Page, 2002) discussions
of “Practitioner Research,” most of the BIA publications
would fall within this definition. This is not to say that BIA
professional educators anticipated the 2002 AERA discussions for
such was clearly not in the works in the 1970’s. What it
means is that most all the doctoral level BIA employees earned
their degrees while working closely and in some capacity, within
and outside of BIA, in American Indian education. The BIA Education
Research Bulletin provided immediate and welcoming publication
of the research. 4

Figure 5 Standardized Test Scores and Teacher Grades |
While there were no national BIA or other evaluations of American
Indian education in the 1970’s in the style of the four previously
discussed, there were national evaluations of the BIA Higher Education
(USBIA, 1979, No. 20) program and an extensive survey of 17 off-reservation
boarding schools (Sahmaunt, 1979). This evaluation was initiated
by the Deputy Assistant Secretary for Indian Affairs who wanted
to close the schools. School Administrators thought differently
and believed that describing the characteristics of the students
would justify not only keeping the schools open but increasing
their budgets to serve the extraordinary needs of the students.
A data form was developed, professional educators were sent to
the schools, and data was taken from the students “Permanent
Folders.” The evaluation had over 7,000 students in it. The
data was massive in scope and included standardized test data and
an estimated “Academic Achievement” based on teacher
grades. The test data was not of good quality but achievement based
on teacher grades was and reflected something close to a normal
distribution. Figure 5 illustrates the historical standardized
test scores and teacher grades. It is interesting that both standardized
test scores based on test norms and teacher grades will reflect
this pattern Nationally, regardless of the tribal culture of the
students. Given the bias of normed tests, it is suggested that
teacher grades might be used in juxtaposition to test norms. Policy, Indian Self-Determination Act.
The Indian Self-Determination policy was initiated by President
Johnson and formalized by President Nixon. Basically, the policy
and statute provide authority for an Indian tribal government receiving
services from the BIA to assume control of a program and administer
it to tribal members. The policy clearly provides legal authority
for tribes to assume control of and administer a BIA program. Pertaining
to education, it is reflected in tribal control of BIA funded schools,
the first of which was Rough Rock Demonstration School located
on the Navajo Nation (Johnson, 1968; Op.cit. USBIA, 1979, No. 01).
Currently, the BIA reports that 120, 65 percent, of its total of
185 funded schools, are under Indian control (USBIA, 2002).
Bilingual Education
Bilingual education which conducted instruction in two languages,
Indian and English, was initiated in BIA funded schools. The first
was Rough Rock, a Navajo schools. At about the same time, 1970,
two language schooling was also initiated by the BIA in Alaska
with the Ypik dialect of Eskimo. (Op. Cit. Hopkins, 2002). Evaluating
American Indian bilingual education presented interesting evaluation
challenges. At the time of implementing them, there was an anti-bilingual
position which expressed the belief that teaching in English, only,
was the most effective, and patriotic approach to educating any
one, including American Indians. Unfortunately, this approach is
still with us in the 21st Century and has many adherents among
Indian populations.
In almost all programs, evaluations of American Indian bilingual
education used standardized achievement tests in some manner. Most
often, standardized tests were used to prove that bilingual education
was as effective as English, only (Rosier and Holm, 1980; Rosier
and Ferella, 1976). The Navajo Rock Point School also used the
Test of Proficiency in English as a Second Language (TOPESL) which
was developed by Eugene Briere at the University of Southern California
under contract from the BIA ( Briere, 1969, 1970; Op.cit. USBIA
1979, No. 13). The Window Rock (Navajo) public school district
in Arizona, developed and made available “The Window Rock
Oral Language Test (WROL, 1981) which was for the primary grades,
but it has been used at upper levels, too. The WROL has been used
extensively with Navajo bilingual education programs to assess
Navajo language education at the primary level.
The 1980’s
The 1980’s were very active regarding the evaluation of
Indian education. However, the reports will not published for public
or government consumption. The House Appropriations Committee was
concerned about the quality of academic education provided by “Contract
School,” which were referred to as ‘Indian Controlled
Schools.” These schools had been Federal under the BIA and
with the passage of P.L. 93-638 the BIA was provided authority
to contract its programs or parts of programs to Indian tribes
so they could administer them. Also, the House Education and Labor
Committee had created a sub-committee which was very active in
developing legislation regarding Indian education. Reports pertaining
to the various Congressional committees can be found in memos and
committee reports.
Navajo Indian Controlled Schools Position Paper (1987). This
position paper was never published yet it contained excellent documentation
of the pressures the House Appropriations Committee was putting
on the Indian controlled schools. This Committee was using achievement
tests to measure the education programs of the schools. This same
design is being used today, 2002, in the three states (Arizona,
New Mexico and Utah) which have education reform achievement test
assessment reform programs. An interesting document attached to
the Navajo Position Paper narrative were memos which established
an official “Achievement Test Policy.” A memo by Dr.
Kenneth Ross clearly references congressional committees as the
motivation for the BIA Achievement Test policy. The culmination
of this activity was a January 23, 1987 memo from the “Deputy
to the Assistant Secretary/Director – Indian Affairs (Indian
Education Programs.) The first paragraph of the memo stated:
All Bureau of Indian Affairs funded schools, including contract
schools, shall administer either the CTBS-U/V or the CAT-E/F. The
Agency Superintendents for Education/Area Education Programs Administrators
will schedule and direct this for the schools under their jurisdiction.
This is a long and detailed memo which assigns the Office of
Indian Education Programs (OIEP) responsibility for “collection,
aggregation, and dissemination of Bureau-wide test information.” After
researching to locate documentation related to this task it is
highly probable that it was never performed by the OIEP.
At that time, 1987, the BIA Navajo Eastern Agency implemented
a program to respond to the House Appropriations Committee’s
pressure. Curiously enough, the Eastern Navajo Agency assessment
program, which cast achievement tests as its foundation, was presented
as a total BIA Education assessment program, which it was not.
The Eastern Navajo Agency used achievement tests (CTBS) to retain
students; evaluate teacher quality and assess their academic program.
In response to this program, Navajo contract schools hired Jim
C. Fortune (1987) to evaluate the Eastern Navajo Agency assessment
program. Fortune established a control group from a North Carolina
public schools that had 12 percent Indians in its membership. Fortune
used a research design and was very careful about data quality
and statistical procedures. Fortune used “t” tests
of significant differences, means, standard deviations and a two
way ANOVA. He used the normal curve equivalent (NCE) rather than
the test grade equivalent because the NCS is more stable. His “Purpose
of Study” could have been instructive to the Congress and
the BIA. It described how most achievement tests are constructed:
Several federal programs such as Chapter I require the use of
school achievement data in the evaluation of local projects. Such
data are also used in monitoring school progress and in the comparison
of schools governed by differing auspices. Due to the nature of
the achievement test forming process, the sole use of test scores
for student placement decisions may result in the appearance of
gains in school progress that are merely artifacts of test interpretation.
Because most developed through the sampling of global knowledge
domains so as to accommodate a wide range of school curriculum
and are leveled for multiple grades, norm statistics rather than
content differ for most sequential grades. For instance, if a fifth
grade student makes a scale score for the total battery of 696
in a spring administration of the CTBS, the student receives a
normal curve equivalent score (NCE) of 50 and a grade equivalent
score (GES) of 5.9. If the student is in the fourth grade, however,
with the same scale score of 696 on the same test; the students
NCE is 61 and his GES is 6.0. Notice that without any change in
the amount of knowledge displayed there is a large change in NCE
and a minor change in GES. This discrepancy of NCE scores represents
a caution that must be taken in school evaluations. (p.2)
Fortune found that about all of Eastern Navajos claims, i.e.
that retaining students was a motivator for increased achievement,
were fallacious. The Navajo Position paper also discovered that
Eastern Navajo’s outstanding teacher handed out to students
actual pages of the tests and taught the children correct answers
just prior to administration of the CTBS. It is interesting that
neither the Congress or the BIA bothered to investigate their rich
historical heritage of the use of standardized tests and their
basic cultural unfairness. Ignorance ruled the day, as it does
today.
Evaluating Indian Controlled Schools. It was mentioned above
that the House Appropriations Committee was concerned about the
quality of Indian controlled schools’ education. The Committee
financed an education evaluation contract to Abt Associated, Inc.
(1985) to assess the quality of the education program in Indian
controlled schools. The evaluation design was elaborate and considerable
effort was made to be fair to the schools. For purposes of comparisons,
BIA and public schools were included. The evaluation included grades
eight and twelve. The total database included School Characteristics,
Student Background Characteristics, Achievement Tests, Attendance,
Costs and Department of Education Title IV Enrichment Projects.
At the eighth grade level the evaluation included 20 Indian controlled
schools, two BIA schools and 10 public schools for a total of 581
students. At grade twelve they included 26 Indian controlled schools,
3 BIA Schools and seven public schools and 455 students. Several
statistical procedures were applied to the data. The report summary
stated:
In this report, we have provided comparative data between Indian
controlled schools and other schools in the same regions serving
Indian students with backgrounds that were as similar as possible,
in order to provide a fair comparison. Objectively, then, we can
say that Indian controlled schools are higher in cost than other
schools, on average; that eighth grade students in Indian controlled
schools are not achieving as well as students in the comparison
schools, on average; that twelfth grade students in Indian controlled
schools are achieving better than twelfth graders in BIA schools
but not as well as twelfth graders in Public schools, on average;
the attendance levels are slightly lower for students in Indian
controlled schools than for students in other schools, on average;
and that Indian controlled schools exhibit higher rates of withdrawal
from twelfth grade than other schools, on average. Whether these
results indicate serious problems in the Indian controlled schools
is open to interpretation. (p. 114)
The elaborate design notwithstanding, the evaluation did not
prove the inferiority of Indian controlled schools.
In all respects, it was a fair attempt to respond to the needs
of the house Appropriations Committee. Here, again, we find a concern
for the education quality differences between school types. The
design matched student types, similar to the design of the Bass
evaluation, and concluded with only some differences. When students
are matched according to background characteristics, few differences
can be expected between school types.
The same as with the National Study of American Indian Education
conducted by Havighurst and Fuchs, this evaluation was also important
because the contract was managed by the U.S. Department of Education,
Office of Indian Education, not the BIA. It should be noted from
above that one of the stipulations was to provide evaluative information
pertaining to Title IV, Enrichment Projects, which were funded
by the Office of Indian Education. This is a somewhat unusual situation
because a Department other than Interior was funded to evaluate
an Interior Department program. This is not often done in U.S.
Government Education activities.
Gifted Attitude Inventory for Navajos. Up to 1980 there had been
very few instruments that tried to provide an education dimension
which was based on an Indian culture. The Din? Bi?Olta? Research
Institute, a Navajo organization headed by Dillon Platero, obtained
a grant from the U.S. Department of Education, Office of Indian
Education to develop the “Gifted Attitudes inventory for
Navajos (GAIN) (Abbott, about 1982). GAIN was piloted on 101 Navajo
high school students and statistics were developed to reflect this
exercise.. GAIN was designed for “adolescents in grades nine
through twelve, or between the ages of approximately thirteen through
nineteen.” The inventory consisted of 66 items for which
there were no right or wrong answers. Each school was encouraged
to develop their own statistical profile. GAIN was used at the
Navajo Academy in 1986 (Platero, 1986) to help identify gifted
and talented students. The report is critical of GAIN as it seemed
to over emphasize moral turpitude. The statistical output did approach
a normal distribution starting at the 21st percentile and going
upward. GAIN has eight sub-categories:
1. Planning
2. Self-Reliance
3. Self Determination
4. Respectfulness
5. Ethics
6. Presentment
7. Attentiveness
8. Motivation
The higher the GAIN raw score, the higher the student reflects
Navajo intelligence and culture. GAIN could probably be used today
as one of the background characteristics to assist in matching
Navajo students and in weighting standardized test scores. It would
be interesting to run correlation’s and regressions which
included GAIN scores.
While working at the Navajo Academy, Hopkins (1986) administered
GAIN to 170 Navajo Students in the spring of 1986. He created a
comprehensive database which included all the GAIN data as well
as CTBS scores in Reading, Language Arts, Math and the Ottis-Lennon
School Abilities I.Q. test. He ran correlation’s on the CTBS,
Lennon and GAIN data and found a weak relationship between the
standardized instruments and GAIN, His correlation matrix follows.
Correlation Matrix
|
READ
|
ENG
|
MATH |
OTIS |
GAINAVG |
READ |
1.000000 |
|
|
|
|
ENG |
.723957 |
1.000000 |
|
|
|
MATH |
.620252 |
.721980 |
1.000000 |
|
|
OTIS |
.713420 |
.620930 |
.484501 |
1.000000 |
|
GAINAVG |
.121127 |
.161980 |
-.026263 |
.182804 |
1.000000
|
PERCTLE |
.120078 |
.155941 |
-.038332 |
.178985 |
.981746 |
|
PERCTLE |
|
|
|
|
PERCTLE |
1.000000 |
|
|
|
|
READ = CTBS Reading sub-test, administered in fall of 1985
ENG = CTBS Language Arts sub-test, administered in fall of 1985
MATH = CTBS Math sub-test, administered in fall of 1985
OTIS = Otis-Lennon Scholastic Ability IQ test
GAINAVG = GAIN mean of percentage scores on each section
PERCTLE = GAIN percentile score for an individual
GAIN was administered April 11, 1986 to 186 students at Navajo
Academy. 170 were determined useful for analysis and reporting.
The database contains all the GAIN scores, including raw scores.

Figure 6 McGraw Hill 1985 Report |
BIA Standardized Testing Program, 1980-1987.
Though not published, the 1980’s were busy. The BIA contracted
with McGraw Hill to test some of its students. McGraw Hill provided
BIA a report
each year. The 1985 report seemed to be available more than other
years (USBIA, 1985; Hopkins, 1987). Only an unspecified sample
of students were tested in the spring of the school year. 15,097
took the CTBS and 5,539 took the CAT for a total of 20,636, which
would be less than half the BIA total number of students. It
can only be assumed that students were not tracked and cohorts
were
used for statistical purposes. When test norms were used, the
same pattern established by the University of Chicago is repeated
i.e.
growth is good in lower grades and falls behind until grade 12
Indian students are two test grade levels behind. The report
was prepared by CTBS McGraw Hill. They issued a word of caution
about
the CTBS being harder than the California Achievement Test (CAT).
It seems that the scores were going up on the CTBS so they made
it harder. This means that the bell curve dominated testing rather
than student learning. The CAT, so they said, was easier than
CTBS as it had not been changed. This report clearly reflects that
the
priority of the education psychologists at McGraw Hill placed
a higher priority on statistical output than on student learning.
When combined with an earlier McGraw Hill specialist who recommended
that under no circumstances should the time to take a sub-test
be changed it is clear that standardized tests and their use
of
the years with Indian students are fundamentally flawed. The 1990’s
Deyhle and Swisher (Op.cit. 1997) made the Self-Determination
a major theme in their review of research literature. Their understanding
of the education situation was that schools taught assimilation
to Indian children. There is no doubt that individual non-Indian
teachers have, contrary to program policy, have taught in such
a manner to reflect assimilation. There is also little doubt that
most textbooks used in Indian classrooms were and are Eurocentric
based. Also, with the vast majority of American Indians now being
educated in public schools, Indian Self-Determination does not
apply. Public schools, alone, may represent a tacit program of
assimilation through education. Interestingly enough, the 120 BIA
funded contract and grant schools may represent the only non-assimilation
schools in the U.S. and they are but a very small proportion of
the total. Regarding assimilation as an education policy, officially,
it has not been a BIA policy since the 1930’s. Nor has it
been the policy of the U.S. Department of Education’s Office
of Indian Education. Nonetheless, with few Indian teachers, it
may be a major purpose of teachers.
Indian Professional Education Researchers. There are now hundreds,
perhaps thousands of professional educators who are Indian. Compared
to 1940 when the entire Peterson evaluation took place without
a single Indian in a decision making position, there has been a
sea change in the education of Indians. We cited above an evaluation
report reflecting the BIA’s paying for the higher education
of educational administrators, which was in 1975. Many of the individuals
who were students in these programs eventually occupied leadership
positions in government, public education and higher education.
It is important that all administrative positions in the BIA’s
Office of Indian Education Programs are Indian.
Indian Teachers. Starting in 1992, the Navajo Nation initiated
a Teacher Education Program (Hopkins, 1998). It started as a part
of the Ford Foundation Minority Teacher Education Program but is
now entirely supported by the Navajo Nation. Since 1992, the program
has produced over 600 elementary, Navajo, certified teachers. Gorman
(2000) reported that over half the elementary teachers in Navajo
Nation schools are now Navajo. Also, there are now a few schools,
BIA funded, only, with a total Navajo staff, including administrators,
teachers and ancillary staff. The Teacher Education program is
still in effect and is gradually producing new certified teachers.
Contemporary BIA Education Evaluation. The BIA has implemented
an education evaluation program for its 185 funded schools. Because
of the complexity of the local situations, no single standardized
test is used. Indeed, no test is mandated. Schools are required
to set targets in the curriculum areas of math and language arts.
In all areas included in the evaluation design, assessments are
divided into three proficiency categories: (1) Advanced, (2) Partially
Proficient. (USBIA, 2002) All reports are school generated with
no outside evaluation assistance. There are individual school “Report
Cards,” which are posted on the BIA Office of Indian Education
Programs web site. It is important to observe the non-test items
included in the BIA assessment program:
EXECUTIVE SUMMARY
The Office of Indian Education Programs (OIEP) functions as a
State Education Agency (SEA) and thus required to issue a report
for each of its schools summarizing records of performance with
the implementation of a comprehensive school reform program; satisfying
our customers; and, more importantly, improving student achievement.
The following information is based on the data reported in the
individual school report cards for SY 2000-01.
During the 2000-2001 school year, the schools reported that:
The Bureau-funded school consisted of 185 schools/dormitories;
.with 65 schools or 35% operated by the Bureau and 120 schools
or 65% operated by the Tribes through a PL 93-638 contract and/or
PL 100-297 Grant, in 23 states.
- The Student enrollments were 48,693, a
decrease of 383 students from SY 99-00.
- Ninety-six percent of the
schools were accredited through the state and/or regional accreditation
association.
- The average daily attendance rate was 90%.
- The yearly enrollment
retention rate was 94%.
- The yearly dropout rate was 11%, for
an increase of 1% from SY 99-00.
- The student math achievement
levels were at 50% Partially Proficient, 44% Proficient and
6% Advanced, for a
consistent result
in student math achievement at the proficiency and advanced levels.
- The student language arts achievement levels were at 50% Partially
Proficient, 45% Proficient and 5% Advanced,
for a
2% increase in student language arts achievement levels at the
Proficient and Advanced levels.
- The proficiency levels of staff
in their use of technology was 26% Partially Proficient, 49%
Proficient and
25% Advanced,
for a 7% increase in the Proficient and Advanced levels.
- The
staff proficiency levels in use of new assessments were 31%
Partially Proficient, 46% Proficient and 23%
Advanced,
an increase of 1% in the Proficient and Advanced levels.
- There
were 3,496 substance abuse incidents for a decrease of 1,035
of substance abuse incidents.
- There were 8,471 violence incidents
for a decrease of 2,235 incidents in violence.
The current BIA education assessment program reflects a 30 year
policy of down playing the use of standardized tests and their
norms for evaluation purposes. This may be changing with the national
state public schools accountability programs and with the national
No Child Left Behind Act (NCLBA). Interestingly enough, the Department
of Education’s OIE does not have an evaluation program.
Navajo Nation Rural Systemic Initiative. This program is funded
by the National Science Foundation (NSF) to the Navajo Nation.
It started in October of 1998 and will conclude in March of 2004.
NSF requires an outside evaluation. Patrick Galvin of the University
of Utah is the Principal Investigator of the evaluation. Galvin,
et. al (2001) using Arizona, Utah and New Mexico test data, provides
analyses that are related to control characteristics such as socioeconomic
status (SES) and Navajo culture. The focus is on math and science
test scores. NSF will accept qualitative evaluations but prefers
quantified evaluations. State data is very important to Navajo
education as public schools provide approximately 75 percent of
schooling for Navajos. All three states have high stakes testing
assessment-accountability programs. None of the states identify
American Indian students by their culture name, which complicates
separating Navajo data from the others. Galvin provides two basic
questions which the evaluation attempts to answer:
- What is the math/science test score achievement gap between
Native American and Anglo students?
- What evidence is there that
the test scores of students attending schools supported by
NN-RSI programs increase at a faster rate
overtime than a comparable group of students not supported by
these initiatives?
In the Executive Summary to the report, Galvin states:
The content of this evaluation report describes in detail the
math/science achievement gap of public school Native American children.
The average math or science scores for Native American students
is about 20 points below the averages for Anglo students. However,
after controlling for English proficiency and socio-economic status
the achievement gap is not large (5 or 6 points). These differences
are still statistically significant and need to be explained.
The evidence of achievement growth in math and science overtime
is not easily summarized. Generally, over the last three or four
years (depending on the data set), the growth of math achievement
for Anglo students has outpaced that for Native American students.
However, in schools and school districts supporting the initiatives
promoted by the NN-RSI the achievement growth of Native American
students has outstripped that of their Anglo counterparts. One
way to summarize these points is to say that the evidence supports
claims that Native American students can do well on standardized
achievement test that curricular and instructional support these
students are making consist and significant achievement gains overtime.
This evaluation uses extant data and does not gather test data
for the stated project purposes. Galvin uses background characteristics
(language and socio-economic) in analyses. This evaluation practice,
we have learned, always produces data different from that which
does not control background characteristics, i.e. the BIA testing
program of the 1980’s. There is always more confidence in
the evaluation when background characteristics are included in
test data analysis. Overall, NN-RSI has generated over 10 evaluation
reports, but Galvin’s addresses the achievement gap in such
a manner as to relate to contemporary Navajo (American Indian)
education.
Navajo Culture and Attitude Instrument. A second Navajo culture
instrument was developed by Kalvin White (1998) for his dissertation
at the University of Utah. This instrument has well developed statistics
and White is a Navajo. Perhaps White’s instrument and GAIN
will in time help provide an important Navajo culture dimension
to Navajo education evaluation.
No Child Left Behind Act. The NCLBA has the potential for reversing
policy regarding Indian Self-Determination in education. The BIA
is considered the 51st state and has developed a state plan for
implementing NCLBA (USBIA-NCLBA, 2002). The state plan discusses
the complexity of following 23 different state assessment plans
as well as others developed by tribal accrediting offices. It states:
Before the Bureau’s new assessment system is fully developed
and implemented in 2004-2005, Bureau-funded schools will be required
to continue to utilize the norm-referenced tests of their states
until their states’ new criterion-referenced assessment
systems are fully approved by the Department of Education. At
that time,
BIA-funded schools will utilize those approved state systems
in their entirety to determine proficiency levels of their students.
(p. 7)
The BIA Mission statement is worth noting, too.
The mission of the Bureau of Indian Affairs, Office of Indian
Education Programs is to provide quality education opportunities
from early childhood through life in accordance with the tribes'
needs for cultural and economic well-being and in keeping with
the wide diversity of Indian tribes and Alaska Native villages
as distinct cultural and governmental entities. The Bureau shall
manifest consideration of the whole person, taking into account
the spiritual, mental, physical and cultural aspects of the person
within family and tribal or Alaska Native village contexts.
The Mission Statement does not mention Indian Self Determination.
Tribal governments are mentioned as “Entities,” which
implies they are something other than legal governments. Without
some clear defense of Self Determination, which is supported by
|