Document: Invite

Comments:

Essay-Grading Software Offers Professors a Break - by John Markoff, New York Times, April 4, 2013

Imagine taking a college exam, and, instead of handing in a blue book and getting a grade from a professor a few weeks later, clicking the “send” button when you are done and receiving a grade back instantly, your essay scored by a software program.

And then, instead of being done with that exam, imagine that the system would immediately let you rewrite the test to try to improve your grade.

EdX, the nonprofit enterprise founded by Harvard and the Massachusetts Institute of Technology to offer courses on the Internet, has just introduced such a system and will make its automated software available free on the Web to any institution that wants to use it. The software uses artificial intelligence to grade student essays and short written answers, freeing professors for other tasks.

The new service will bring the educational consortium into a growing conflict over the role of automation in education. Although automated grading systems for multiplechoice and truefalse tests are now widespread, the use of artificial intelligence technology to grade essay answers has not yet received widespread endorsement by educators and has many critics.

Anant Agarwal, an electrical engineer who is president of EdX, predicted that the instantgrading software would be a useful pedagogical tool, enabling students to take tests and write essays over and over and improve the quality of their answers. He said the technology would offer distinct advantages over the traditional classroom system, where students often wait days or weeks for grades.

“There is a huge value in learning with instant feedback,” Dr. Agarwal said. “Students are telling us they learn much better with instant feedback.”

But skeptics say the automated system is no match for live teachers. One longtime critic, Les Perelman, has drawn national attention several times for putting together nonsense essays that have fooled software grading programs into giving high marks. He has also been highly critical of studies that purport to show that the software compares well to human graders.

“My first and greatest objection to the research is that they did not have any valid statistical test comparing the software directly to human graders,” said Mr. Perelman, a retired director of writing and a current researcher at M.I.T.

He is among a group of educators who last month began circulating a petition opposing automated assessment software. The group, which calls itself Professionals Against Machine Scoring of Student Essays in HighStakes Assessment, has collected nearly 2,000 signatures, including some from luminaries like Noam Chomsky.

“Let’s face the realities of automatic essay scoring,” the group’s statement reads in part. “Computers cannot ‘read.’ They cannot measure the essentials of effective written communication: accuracy, reasoning, adequacy of evidence, good sense, ethical stance, convincing argument, meaningful organization, clarity, and veracity, among others.”

But EdX expects its software to be adopted widely by schools and universities. EdX offers free online classes from Harvard, M.I.T. and the University of California, Berkeley; this fall, it will add classes from Wellesley, Georgetown and the University of Texas. In all, 12 universities participate in EdX, which offers certificates for course completion and has said that it plans to continue to expand next year, including adding international schools.

The EdX assessment tool requires human teachers, or graders, to first grade 100 essays or essay questions. The system then uses a variety of machinelearning techniques to train itself to be able to grade any number of essays or answers automatically and almost instantaneously.

The software will assign a grade depending on the scoring system created by the teacher, whether it is a letter grade or numerical rank. It will also provide general feedback, like telling a student whether an answer was on topic or not.

Dr. Agarwal said he believed that the software was nearing the capability of human grading.

“This is machine learning and there is a long way to go, but it’s good enough and the upside is huge,” he said. “We found that the quality of the grading is similar to the variation you find from instructor to instructor.”

EdX is not the first to use automated assessment technology, which dates to early mainframe computers in the 1960s. There is now a range of companies offering commercial programs to grade written test answers, and four states — Louisiana, North Dakota, Utah and West Virginia — are using some form of the technology in secondary schools. A fifth, Indiana, has experimented with it. In some cases the software is used as a “second reader,” to check the reliability of the human graders.

But the growing influence of the EdX consortium to set standards is likely to give the technology a boost. On Tuesday, Stanford announced that it would work with EdX to develop a joint educational system that will incorporate the automated assessment technology.

Two startups, Coursera and Udacity, recently founded by Stanford faculty members to create “massive open online courses,” or MOOCs, are also committed to automated assessment systems because of the value of instant feedback.

“It allows students to get immediate feedback on their work, so that learning turns into a game, with students naturally gravitating toward resubmitting the work until they get it right,” said Daphne Koller, a computer scientist and a founder of Coursera.

Last year the Hewlett Foundation, a grantmaking organization set up by one of the HewlettPackard founders and his wife, sponsored two $100,000 prizes aimed at improving software that grades essays and short answers. More than 150 teams entered each category. A winner of one of the Hewlett contests, Vik Paruchuri, was hired by EdX to help design its assessment software.

“One of our focuses is to help kids learn how to think critically,” said Victor Vuchic, a program officer at the Hewlett Foundation. “It’s probably impossible to do that with multiplechoice tests. The challenge is that this requires human graders, and so they cost a lot more and they take a lot more time.”

Mark D. Shermis, a professor at the University of Akron in Ohio, supervised the Hewlett Foundation’s contest on automated essay scoring and wrote a paper about the experiment. In his view, the technology — though imperfect — has a place in educational settings.

With increasingly large classes, it is impossible for most teachers to give students meaningful feedback on writing assignments, he said. Plus, he noted, critics of the technology have tended to come from the nation’s best universities, where the level of pedagogy is much better than at most schools.

“Often they come from very prestigious institutions where, in fact, they do a much better job of providing feedback than a machine ever could,” Dr. Shermis said. “There seems to be a lack of appreciation of what is actually going on in the real world.”

DMU Timestamp: April 10, 2015 19:32

Paragraph 1 0

Apr 17

Mr. Hadley Robins Mr. Hadley Robins (Apr 17 2015 8:10AM) : This almost sounds too good to be true!

Apr 30

Mr David Lewis Mr David Lewis (Apr 30 2015 3:04PM) : lol at first it does but if it really works it could change everything that we know

May 4

Ms. JAMILAH JENKINS Ms. JAMILAH JENKINS (May 04 2015 9:04AM) : I can't say I've ever turned in a college exam inside of a blue book.. I believe I've seen such a thing in movies. I really like the automatic scoring option though!

Paragraph 1, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 2 0

Apr 21

Kevin Dawkins Kevin Dawkins (Apr 21 2015 10:04PM) : So, we are allowed multiple opportunities to correct our mistakes? If so, then why study when its ultimately fail proof?

Apr 26

Elizabeth Hallen Elizabeth Hallen (Apr 26 2015 2:41PM) : Comment to Kevin Dawkins more

I believe that students benefit greatly from correcting their mistakes, especially in a mathematics classroom. Students receive their initial grade on the assessment, but then are given the opportunity to work through the assessment again to see their mistakes and correct them. Some teachers allow for students to get half credit back on this or others, like me, use this as a homework grade for completion. As for essays, we are most often given the opportunity to turn in a first draft if not a second then third before handing in for the final draft. I see this as the same concept.

May 4

Ms. JAMILAH JENKINS Ms. JAMILAH JENKINS (May 04 2015 9:05AM) : Woah! An option for a second chance? It really doesn't get any better than that!

Paragraph 2, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 3 0

No paragraph-level conversations. Start one.

Paragraph 3, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 3, Sentence 2 0

Apr 27

Bo Park Bo Park (Apr 27 2015 9:15PM) : I understand professors may have additional tasks to complete, but when it comes to writing I would want to have a direct feedback from the professor.

Paragraph 4 0

Apr 24

Chintan Patel Chintan Patel (Apr 24 2015 6:43PM) : I understand the lack of endorsement. Without the professor's mind reading the essays, how can one defend a machine's interpretation to be valid?

Paragraph 4, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 4, Sentence 2 0

No sentence-level conversations. Start one.

Paragraph 5 0

No paragraph-level conversations. Start one.

Paragraph 5, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 5, Sentence 2 0

No sentence-level conversations. Start one.

Paragraph 6 0

Apr 26

Kesha Mathis Kesha Mathis (Apr 26 2015 10:18PM) : Instant feedback more

Why should everything be instants? What happen to having a processing stage.

Paragraph 6, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 6, Sentence 2 0

No sentence-level conversations. Start one.

Paragraph 7 0

No paragraph-level conversations. Start one.

Paragraph 7, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 7, Sentence 2 0

May 4

Ms. JAMILAH JENKINS Ms. JAMILAH JENKINS (May 04 2015 9:08AM) : I have to appreciate guys like Les Perelman. The grass isn't always greener on the other side. There will always be side effects. This paragraph forces me to think of the many people would possibly lose their jobs to automated systems.

Paragraph 7, Sentence 3 0

No sentence-level conversations. Start one.

Paragraph 8 0

No paragraph-level conversations. Start one.

Paragraph 8, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 9 0

No paragraph-level conversations. Start one.

Paragraph 9, Sentence 1 0

Apr 19

Claire Dell Claire Dell (Apr 19 2015 12:15AM) : This sounds like trying to stop progress....

Paragraph 9, Sentence 2 0

No sentence-level conversations. Start one.

Paragraph 10 0

No paragraph-level conversations. Start one.

Paragraph 10, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 10, Sentence 2 0

No sentence-level conversations. Start one.

Paragraph 10, Sentence 3 0

Apr 26

Andrew Christmas Andrew Christmas (Apr 26 2015 3:54PM) : These are the primary purpose of essay writing. I believe computer grading will lead to more formulated essay writing, but less actual content.

May 4

Ms. JAMILAH JENKINS Ms. JAMILAH JENKINS (May 04 2015 9:08AM) : I have to agree with you on that one.

Apr 27

Bo Park Bo Park (Apr 27 2015 9:17PM) : No matter how far we've come with artificial intelligence, computers cannot read emotions and the additional aspects of writing listed in this sentence.

May 4

Ms. JAMILAH JENKINS Ms. JAMILAH JENKINS (May 04 2015 9:10AM) : You are so correct. There is a major problem within our society where we seem to crave instantaneous gratification. This isn't always good.

Paragraph 11 0

No paragraph-level conversations. Start one.

Paragraph 11, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 11, Sentence 2 0

No sentence-level conversations. Start one.

Paragraph 11, Sentence 3 0

No sentence-level conversations. Start one.

Paragraph 12 0

No paragraph-level conversations. Start one.

Paragraph 12, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 12, Sentence 2 0

No sentence-level conversations. Start one.

Paragraph 13 0

No paragraph-level conversations. Start one.

Paragraph 13, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 13, Sentence 2 0

No sentence-level conversations. Start one.

Paragraph 14 0

No paragraph-level conversations. Start one.

Paragraph 14, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 15 0

No paragraph-level conversations. Start one.

Paragraph 15, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 15, Sentence 2 0

No sentence-level conversations. Start one.

Paragraph 16 0

No paragraph-level conversations. Start one.

Paragraph 16, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 16, Sentence 2 0

No sentence-level conversations. Start one.

Paragraph 16, Sentence 3 0

No sentence-level conversations. Start one.

Paragraph 16, Sentence 4 0

No sentence-level conversations. Start one.

Paragraph 17 0

No paragraph-level conversations. Start one.

Paragraph 17, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 17, Sentence 2 0

No sentence-level conversations. Start one.

Paragraph 18 0

No paragraph-level conversations. Start one.

Paragraph 18, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 19 0

Apr 26

Adam Gasaway Adam Gasaway (Apr 26 2015 4:39PM) : Resubmission is not learning. [Edited] more

Allowing students to resubmit work “until they get it right” is lunacy. I could understand if a student completely missed the point of an assignment or if instructions weren’t clear, or if the nature of the assignment was to elicit feedback and then revise. But, simply letting students resubmit assignments until those assignments resemble something that we think makes it look like they’ve learned something is a disaster waiting to happen.

Paragraph 19, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 20 0

No paragraph-level conversations. Start one.

Paragraph 20, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 20, Sentence 2 0

No sentence-level conversations. Start one.

Paragraph 20, Sentence 3 0

No sentence-level conversations. Start one.

Paragraph 21 0

No paragraph-level conversations. Start one.

Paragraph 21, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 21, Sentence 2 0

No sentence-level conversations. Start one.

Paragraph 21, Sentence 3 0

No sentence-level conversations. Start one.

Paragraph 22 0

No paragraph-level conversations. Start one.

Paragraph 22, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 22, Sentence 2 0

No sentence-level conversations. Start one.

Paragraph 23 0

Apr 26

Elizabeth Hallen Elizabeth Hallen (Apr 26 2015 2:36PM) : Paragraph 23 more

I absolutely agree that meaningful feedback is one of the main, if not the only, ways for students to grow as writers. Especially in a college course, there could be up to 300 students in an entry level English course; there is no way that a teacher can leave meaning critiques on every students paper even with graduate assistances. I think this method of grading is amazing and I can’t wait to see how this impacts teachers in the high school level.

Apr 26

Adam Gasaway Adam Gasaway (Apr 26 2015 4:43PM) : Maybe instead of trying to make larger class sizes more manageable, we should focus on reducing class size in order to make learning more impactful.

Paragraph 23, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 23, Sentence 2 0

No sentence-level conversations. Start one.

Paragraph 24 0

No paragraph-level conversations. Start one.

Paragraph 24, Sentence 1 0

No sentence-level conversations. Start one.

Paragraph 24, Sentence 2 0

No sentence-level conversations. Start one.

General Document Comments 0

Apr 19

Claire Dell Claire Dell (Apr 19 2015 12:34AM) : Comment on whole article more

Like so much other technology in education, this is initially being received hesitantly and with a lot of criticism. But I think it will be developed to a point where it will be a useful tool and provide valuable support to instructors and students.

Spelling and grammar checks in microsoft word are not always correct, but they are often enough to be very useful and improve the standard of writing. In the same way I think this technology may help students prior to submitting essays.

As far as replacing a human grader goes, that depends on how knowledgeable the human is! The technology may progress to a point where it does just as well as many graders, but may never replace a well-qualified English teacher.

With the increasing numbers of students and pressure on teachers, I think this sort of technology is inevitable and will form part of a system that will become predominantly automated.

Like the discussion on translating earlier in the semester, I think this technology is worth researching as it will definitely provide benefits, but it may never replace a qualified individual.

Apr 20

Brian Skrzypek Brian Skrzypek (Apr 20 2015 8:58AM) : Comment on whole article more

As an upcoming English teacher, I may be hesitant to implement an auto-grading essay system in my classroom. With our current technology, a machine that grades a paper automatically will have many flaws when compared to the traditional method.

One of the biggest flaws that I worry about is the lack of personality that a computer has when compared to a human. Many teachers teach what they were taught, so not every essay will be physically scored in a similar manner. Although this sounds like a bad thing, it is actually good because the learner can learn new concepts from different perspectives. We just can’t get that from a computer programmed to grade an essay with a one-tracked mind.

Another problem is the inevitable use of cheating techniques. When a teacher hand grades an essay, it is harder for a student to cheat, since the essay is directly being looked upon. With an automated essay scoring system, students will eventually find out what the software looks for in an essay. If a teacher only lets the computer check the assignment, then a student can pass an essay that he/she barely took the time to do.

Apr 21

Maggie Trimble Maggie Trimble (Apr 21 2015 10:20AM) : Essay Grading Software has a place in the classroom, but can't be given total autonomy over the essay grading process. more

I understand the benefits that a automated grading system for essays could bring. This could be quick feedback and lack of bias. Despite this, having a computer grade my essay would make me uncomfortable as a student. The article admitted that this software still has a long way to go and honestly I do not see a computer being more accurate than a trained professional who understands the context and the assignment. However, I do think that it would be beneficial to use both. The teacher could use the software to check things that the teacher could have missed or to gain different perspective. The teacher could also let the students have one re-do by using the software which could help the students better understand their grammar mistakes and overall writing style. So, I do see a place for this software, but I do not think the teacher should be totally left out of grading essays. -Maggie Trimble

Apr 21

Kevin Dawkins Kevin Dawkins (Apr 21 2015 10:10PM) : While this technology seems like it could be a good idea, I do not see how it could be beneficial in the long run. Too many errors could take place and it would be hard to match the exact context of what the writer was trying to say. more

I can see where this automatic grading could be useful when grading multiple choice and matching answers. However, I do not see the benefit of having this software grading short answers and essay.

I might be one of the skeptics, but in my mind there is no way for a machine to take words and understand the exact context in which the writer is meaning to use them in. I feel like if this software was to be used in schools then children would eventually be graded unfairly which would result in a whole other mess in itself having to deal with upset students and parents.

Apr 24

Chintan Patel Chintan Patel (Apr 24 2015 6:50PM) : Automated grading of essays has too many red flags for my liking. At the end of the day, a machine simply cannot interpret text like a particular human. more

I cannot say that I support this method of grading. I understand the want for instant feedback on a student’s part, but I can’t see how the feedback will always be relevant. You can have as many lines of code as you want to try to interpret all meanings, but if there is even one instance where the machine interprets incorrectly, the system is flawed in my opinion.
It may take longer, but when a professor grades a paper, they can be approached with questions. You can ask the professor why they thought a certain way about your answer, or back your answer up with evidence. You don’t have that option with a machine grading an essay.
The fact is when it comes to essays and written word, the grading should come from a qualified teacher that actually taught you the material. Written word should be analyzed by someone that can feel emotion, or simply interpret on multiple levels. The concept of the software is nice, but I think the instant grades should be continue to apply only with multiple choice or true of false questions. Anything that can be interpreted in multiple ways should be graded by a human and not a machine that supposedly thinks like said human.

Apr 26

Andrew Christmas Andrew Christmas (Apr 26 2015 4:01PM) : What is the real point of writing essays? more

If the purpose is merely to give grades then there are obvious advantages. Unfortunately I think there are some serious problems with the way we expect students to write essays over and over and 95% of them are just for a grade with no real audience. Academic writing has became so formulated that it is not particularly hard to put together all the “required” elements to get a good grade without really putting any original ideas on paper. And really there is little incentive to come up with an original argument if your audience is just a computer.

Apr 26

Math teacher Susan Protzman Math teacher Susan Protzman (Apr 26 2015 9:13PM) : I have a difficult time seeing exactly how this would be implemented in math content, although I guess you could automate some basic calculations, but not sure about others. However, for writing, I think this is fabulous. [Edited] more

One question arises though, as teachers instruct on writing in different manners, the ‘grader’ (I suppose) would grade the same way, each time. So, is there a way that the teacher can alter some of the ‘grading’ or is there a ‘rubric’ so to speak, against which the teacher instructs.

I say this only when I see frustrated students, as they learn to write from teacher ‘A’, then the following year teacher ‘B’ is focusing on other elements of the writing, and they feel they are starting all over again. With this kind of tool, it seems that its ‘one direction’ or ‘method’.

Apr 26

Kesha Mathis Kesha Mathis (Apr 26 2015 9:58PM) : Comment on the whole article [Edited] more

Imagine a time when grading papers becomes less laborious for professors and instructors, more instantaneous with feedback, and more encouraging and useful in engaging the desire of student writers to improve their writing be it essays, essay responses or other types of writings. Well, thanks to enterprise EdX, founded by Harvard and the University of Massachusetts, this idea is becoming more realized. EdX has created a software that can instantaneously review and respond with results to students on their written assignments. The software is showing promise. This article discusses much of the technical details of what the software is designed to do. The article also discusses the debate on the viability of such software. Parties oppose and support the idea of this software. Many of the opposition come from very prestigious universities including M.I.T. like Les Perelman who is a researcher for the institution. All parties agree that the software is imperfect and needs work, as similar software has been intentionally fooled by intentionally bogus work. However, the reasons for opposition and support vary vastly. Opponents believe that such software will not and does not equate to the efficiency, effectiveness, and accuracy of a human grader. Supporters believe the software can be trained and improved upon, it relieves the load of the instructor allow them to focus on other tasks, and it encourages students to continually resubmit their work to get better grades. The software can be programmed based on the instructor’s requirements to give an assessment on the writing. In my opinion, the software, while not perfect, does show promise in that it can be improved on much like GPS can now show results with the least traffic or without tolls, or how ATMs can count check and dollar amounts at the ATM. I believe the software will help individuals learn to write better because they are receiving instant feedback instead of having to wait for the teacher to grade and having to remember what they wrote and all the added time it takes to learn and grow in writing skills. I think this software can also be grown to be better than human graders. Even teachers forget lessons and concepts about great writing but computers don’t. Again, while not perfect, this viable because it does have concrete promise.

Apr 30

Mr David Lewis Mr David Lewis (Apr 30 2015 3:03PM) : This technology has the ability to drive the future direction of pedadgogy and education more

This technology has an amazing upside. With anything new there will be skeptics but the positive aspects outweigh the negative implications. More and more people are going to college and online colleges are expanding more and more. The number of people attending college have increased dramatically and oftentimes students have to wait weeks to receive grades on essays. This dramatically affects instruction as teachers are forced to give multiple choice questions or short answer questions. This dramatically affects the way that professors evaluate students understanding of the content as it difficult for them to assess true mastery of the content via multiple choice questions and short answers. This would dramatically improve pedagogy in the classroom and increase effective learning that would drive instruction. There are some drawbacks to using an automated system as there would be some papers that may be incorrectly graded but the positive implications of this new technology dramatically outweighs all these possible negative things that may occur and has the ability to take education into a new direction

Image

0 comments, 0 areas

add area

add comment

change display

help

Video

add comment

help

Please choose from the list of thinking partners to the left

Choose a tab, then select
your Thinking Partner

Original

Resubmission