TeachingCase TeachingStructuredAuthoringandDITAThroughRhetorical andComputationalThinking ·...

16
328 IEEE TRANSACTIONS ON PROFESSIONAL COMMUNICATION, VOL. 58, NO. 3, SEPTEMBER 2015 Teaching Case Teaching Structured Authoring and DITA Through Rhetorical and Computational Thinking —CARLOS EVIA,MATTHEW R. SHARP, AND MANUEL A. PÉREZ-QUIÑONES Abstract—Background: The diffusion of component content management and structured authoring workows and technologies in technical communication requires that instructors of documentation courses determine effective ways to teach component content management to students who may initially be intimidated by authoring environments and structures, such as the Darwin Information Typing Architecture (DITA). This teaching case describes how component content management and DITA were integrated into the Creating User Documentation course of an undergraduate professional writing program. Research questions: How can instructors of technical and professional writing best teach English and humanities students to operate within a structured authoring workow? How can computational abstraction be combined with students' previously acquired genre knowledge to ease their adoption of the DITA to create technical documentation? Situating the case: The development of this course was informed by literature from a variety of scholarly and industry sources, which reveal connections between DITA, computational thinking, and Rhetorical Genre theory. Specically, the concept of “layers of abstraction” guides the development of the course's structure, allowing students to separate and independently process the various aspects of a structured authoring workow. How the case was studied: The case was studied informally through the experience of the authors as they developed and taught the course, through informal discussions and structured interviews with industry professionals, and through student reections from discussion forum posts from Fall 2012 through Fall 2013. About the case: Initially developed with a focus on print manuals and online help, the course began teaching topic-based authoring in the mid-2000s; however, most enterprise-level editors and tools were cost-prohibitive for students and faculty. Furthermore, many computing concepts associated with structured authoring were intimidating for an audience of students in an English department. An affordable solution was adopting the open-source DITA standard, using free trials or open-source editors. The intimidation factor was minimized by designing the course around ve layers of abstraction that draw on students' previous rhetorical knowledge: Layer 1: Developing quality documentation, Layer 2: Separating content from design, Layer 3: Authoring granular content with XML, Layer 4: Authoring and linking Component Content Management modules with DITA, and Layer 5: Single-sourcing and content reuse. This case discusses each layer of abstraction, the associated assignments for each layer, and the results of each layer based on student feedback. Results and conclusions: Although the course is not universally loved by students, it has seen many successes and provides a much-needed foundation in component content management and structured authoring for students who might become technical communicators. The teaching team has learned to avoid overemphasizing coding and automation in structured authoring, maintain a solid grounding on writing principles and good technical communication requirements, and draw upon students' existing knowledge of genres and their constraints. Index Terms— Abstractions, computer languages, DITA, documentation, rhetoric, standards, writing, XML. In “Component Content Management: Shaping the Discourse through Innovation Diffusion Research and Reciprocity,” Andersen [1] makes a call to combine efforts from the academia and industry sides of technical communication to create awareness about the importance of component content-management diffusion. Structured Manuscript received May 26, 2014; revised October 12, 2014; accepted December 16, 2014. Date of current version January 29, 2016. C. Evia is with the Department of English and Center for Human–Computer Interaction, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061 USA (email: [email protected]). M. R. Sharp is with the Humanities and Communication Department, Embry-Riddle Aeronautical University, Daytona Beach, FL 32114 USA (email: [email protected]). M. A. Pérez-Quiñones is with the Department of Software and Information Systems, University of North Carolina at Charlotte, Charlotte, NC 28223 USA (email: [email protected]). IEEE 10.1109/TPC.2016.2516639 authoring is a key component of an effective component content-management environment, and while many technical communicators still work for companies that use unstructured content [2], adoption of a structured authoring workow is increasing [3]. In addition, reports on industry practices [3] and skills required in job postings [2] frequently cite the Darwin Information Typing Architecture (DITA) as the most popular content structure in technical communication. This steady increase in demand for the skills of structured authoring is not the only reason to teach about DITA in technical communication courses, however. McShane claims that embracing technologies, such as XML, offer “us—and our students—an opportunity to employ the full-range of our rhetorical skills” [4, p. 74]. In other words, it allows our students to critically engage with a new technology, providing them additional 0361-1434 © 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

Transcript of TeachingCase TeachingStructuredAuthoringandDITAThroughRhetorical andComputationalThinking ·...

328 IEEE TRANSACTIONS ON PROFESSIONAL COMMUNICATION, VOL. 58, NO. 3, SEPTEMBER 2015

Teaching Case

Teaching Structured Authoring and DITA Through Rhetoricaland Computational Thinking—CARLOS EVIA, MATTHEW R. SHARP, AND MANUEL A. PÉREZ-QUIÑONES

Abstract—Background: The diffusion of component content management and structured authoring workflows andtechnologies in technical communication requires that instructors of documentation courses determine effective waysto teach component content management to students who may initially be intimidated by authoring environments andstructures, such as the Darwin Information Typing Architecture (DITA). This teaching case describes how componentcontent management and DITA were integrated into the Creating User Documentation course of an undergraduateprofessional writing program. Research questions: How can instructors of technical and professional writing bestteach English and humanities students to operate within a structured authoring workflow? How can computationalabstraction be combined with students' previously acquired genre knowledge to ease their adoption of the DITA tocreate technical documentation? Situating the case: The development of this course was informed by literaturefrom a variety of scholarly and industry sources, which reveal connections between DITA, computational thinking,and Rhetorical Genre theory. Specifically, the concept of “layers of abstraction” guides the development of thecourse's structure, allowing students to separate and independently process the various aspects of a structuredauthoring workflow. How the case was studied: The case was studied informally through the experience of theauthors as they developed and taught the course, through informal discussions and structured interviews withindustry professionals, and through student reflections from discussion forum posts from Fall 2012 through Fall2013. About the case: Initially developed with a focus on print manuals and online help, the course began teachingtopic-based authoring in the mid-2000s; however, most enterprise-level editors and tools were cost-prohibitive forstudents and faculty. Furthermore, many computing concepts associated with structured authoring were intimidatingfor an audience of students in an English department. An affordable solution was adopting the open-source DITAstandard, using free trials or open-source editors. The intimidation factor was minimized by designing the coursearound five layers of abstraction that draw on students' previous rhetorical knowledge: Layer 1: Developing qualitydocumentation, Layer 2: Separating content from design, Layer 3: Authoring granular content with XML, Layer 4:Authoring and linking Component Content Management modules with DITA, and Layer 5: Single-sourcing andcontent reuse. This case discusses each layer of abstraction, the associated assignments for each layer, and theresults of each layer based on student feedback. Results and conclusions: Although the course is not universallyloved by students, it has seen many successes and provides a much-needed foundation in component contentmanagement and structured authoring for students who might become technical communicators. The teaching teamhas learned to avoid overemphasizing coding and automation in structured authoring, maintain a solid grounding onwriting principles and good technical communication requirements, and draw upon students' existing knowledgeof genres and their constraints.

Index Terms— Abstractions, computer languages, DITA, documentation, rhetoric, standards, writing, XML.

In “Component Content Management: Shapingthe Discourse through Innovation DiffusionResearch and Reciprocity,” Andersen [1] makesa call to combine efforts from the academia andindustry sides of technical communication to createawareness about the importance of componentcontent-management diffusion. Structured

Manuscript received May 26, 2014; revised October 12, 2014;accepted December 16, 2014. Date of current version January29, 2016.C. Evia is with the Department of English and Center forHuman–Computer Interaction, Virginia Polytechnic Instituteand State University, Blacksburg, VA 24061 USA (email:[email protected]).M. R. Sharp is with the Humanities and CommunicationDepartment, Embry-Riddle Aeronautical University, DaytonaBeach, FL 32114 USA (email: [email protected]).M. A. Pérez-Quiñones is with the Department of Software andInformation Systems, University of North Carolina at Charlotte,Charlotte, NC 28223 USA (email: [email protected]).

IEEE 10.1109/TPC.2016.2516639

authoring is a key component of an effectivecomponent content-management environment,and while many technical communicators stillwork for companies that use unstructuredcontent [2], adoption of a structured authoringworkflow is increasing [3]. In addition, reports onindustry practices [3] and skills required in jobpostings [2] frequently cite the Darwin InformationTyping Architecture (DITA) as the most popularcontent structure in technical communication.This steady increase in demand for the skills ofstructured authoring is not the only reason toteach about DITA in technical communicationcourses, however. McShane claims that embracingtechnologies, such as XML, offer “us—and ourstudents—an opportunity to employ the full-rangeof our rhetorical skills” [4, p. 74]. In other words,it allows our students to critically engage witha new technology, providing them additional

0361-1434 © 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

EVIA et al.: TEACHING STRUCTURED AUTHORING AND DITA 329

insight into how technologies can affect thecontent they produce and letting them experiencethe rhetorical affordances and constraints of aparticular technological choice. Eble [5] echoesthis connection between rhetorical skills andtechnology by repurposing the traditional canonsof memory and delivery as database/archives andinterface/distribution, respectively. These are thecanons of component content management, wherecontent is stored in the database and repurposed tobe distributed across various versions and media.Our students live in this world. They use structuredcontent, and will most likely be required to createand manage it at some point in their careers.

This teaching case is based on experiences coveringstructured authoring and DITA in the ProfessionalWriting undergraduate program of a majorresearch university in Virginia. The authors haveworked to design and teach a course, ENGL 3814:Creating User Documentation which, in its currentiteration, includes modules and assignments usingExtensible Markup Language (XML), ExtensibleStylesheet Language Transformations (XSLT), andDITA, acknowledging that these are currently themost common ways to structure technical content.The case includes recommendations for programswith modest budgets for software tools.

During the course redesign, ongoing since 2006,the progression of topics related to XML andcomponent content management has representeda learning challenge for most English majors andminors. Whereas basic XML syntax and tags arerelatively easy to teach and learn, the automation ofdeliverables via XSLT and the DITA Open Toolkit ismuch more complicated. The issue is compoundedby a student population with vastly differentcomputing skills. At this large school, known forits engineering programs, some students in theclass have experience with programming languagesfrom other academic disciplines; however, mostEnglish and communication students often avoidanything too “technical.” Furthermore, students areuncomfortable with the separation of format fromcontent because, as noted by McShane [4], theyhave always had control over the format of theirdocuments in Microsoft Word and even in manyHTML-based web environments.

The solution proposed in this case includesincorporating elements of Computational Thinking,which takes an approach to “solving problems,designing systems and understanding human

behavior by drawing on concepts fundamentalto computer science” [6, p. 33]. Specifically, ourapproach focuses on the concept of abstraction,which has been identified as a key componentof Computational Thinking [7], [8] and proposeslearning progressions that lead to a wholeunderstanding of the methodology, tools, andbenefits of a particular system. Abstraction allowsus to separate the “layers” of a particular problem,and work on each one individually without concernfor the others. Then, as the layers are recombined,they work together to solve the problem at hand,much like an algorithm. By separating the processof producing documentation through componentcontent management into several layers ofabstraction that students can tackle one at a time,we allow them to focus their efforts on each layerbefore adding another. By the end of the semester,what once seemed daunting and antithetical totheir experience, makes sense as part of a largerprocess of creating documentation.

This case combines these elements of computationalthinking with applied rhetoric to create replicablemodels in response to the critical mass of industrycontent-management adoption at the heartof this special issue. For readers of the IEEETRANSACTIONS ON PROFESSIONAL COMMUNICATION,this manuscript offers a model for incorporating atechnical standard for documentation (DITA) intoteaching practices of technical and professionalcommunication. Particularly, the case is driven bythe following research questions:

• How can instructors of technical and professionalwriting best teach English/humanities studentsto operate within a structured authoringworkflow?

• How can computational abstraction be combinedwith students' previously acquired genreknowledge to ease their adoption of DITA tocreate technical documentation?

We begin by situating the case within relevantliterature about Rhetorical Genre theory,computational thinking, and DITA. Then, wediscuss how the case was studied as a anexperience-based needs assessment and evaluationreport, followed by an in-depth discussion aboutthe case itself, including how it was developed andexecuted based on five layers of abstraction. Finally,we end by discussing our general conclusionsabout the course, the limitations of the case, andsuggestions for future research.

330 IEEE TRANSACTIONS ON PROFESSIONAL COMMUNICATION, VOL. 58, NO. 3, SEPTEMBER 2015

SITUATING THE CASEThis section situates the case within theliterature of the research and practice of technicalcommunication and computer science. Afterexplaining how the literature was selected, wethen discuss the key concepts to the course'sdevelopment and evolution: Rhetorical Genretheory, DITA, and computational thinking.

How Literature was Selected In order to providea balanced perspective on the importance ofcomponent content management and DITA, theliterature informing this teaching case was chosenfrom a variety of scholarly and industry sources.Scholarly publications in rhetoric, writing, andtechnical communication that define genres andcollections of genres by the actions they performwithin larger collectives of activity were used todiscuss Rhetorical Genre theory. Publications andtextbooks written by industry experts on technicalcommunication and content management informour discussions of DITA, and scholarly publicationsfrom journals in computer science inform ourdiscussions of computational thinking. Specifically,we referenced literature that defined computationalthinking as a method for problem solving anddiscussed how it can be applied to other disciplines,particularly in nontechnical fields.

Rhetorical Genre Theory According to Bitzer'sdefinition, “a work of rhetoric is pragmatic; itperforms some task” [9, p. 4]. It responds toa certain exigence in order to effect change inthe world. Miller [10] claims that a rhetoricalunderstanding of genre, therefore, must understandit as a typified social action in response to arecurring exigence. Russell used the words“routinized” and “operationalized” [11, p. 546].In other words, genres are a standardized way oftaking action through text, and that standardizationoften includes a unique structure or expectedtypes of content. Genres, however, are defined bythe tasks they accomplish, not by their format.Likewise, the granular topics within componentcontent-management workflows are defined by theactions they perform—their purpose—within thedocumentation [12]. That definition as a specifictype of topic, then, includes some expectations forthe content. For instance, in DITA, tasks tell theuser how to perform a certain action, and they areexpected to contain steps in the imperative mood.Concepts, on the other hand, explain ideas andprocesses and are expected to contain paragraphsmade up of declarative statements. As noted byRobidoux [13], writing technical content requires

the integration of various types of information intocoherent structures that users can predict. Muchlike different genres, these different informationtypes perform different actions individually, butthey work together to accomplish the larger goals ofa documentation project.

The combination of various information typeswithin a component content-management workflowis reminiscent of a genre set. Devitt [14] introducedthe notion of genre sets, which are collections ofgenres used to perform a number of related tasks.Her study described the various genres used bytax accountants to perform their work, includingvarious types of letters, memoranda, proposals,reviews, and protests. Bazerman [15] expanded theconcept of a genre set to a genre system wheremultiple genres from multiple genre sets coordinateto accomplish the goals of an entire system ofactivity. The genre system has a variety of users,all appropriately responding to each other's genreusage with genres from their own genre sets in orderto accomplish the larger goals of the activity system.This collaboration of genres and users mirrors theways in which various topics and authors in aDITA environment collaborate to author content formultiple channels and deliverables.

While undergraduate English students may nothave this depth of knowledge about genres, sets,and systems, they do take courses that touchon the rhetorical understanding of genres. Forthe most part, the students in the course seemto understand the concept of genre and thatdifferent genres accomplish different goals, performdifferent tasks (a layer of abstraction in itself),and have different requirements and constraints.Creating User Documentation was designed tocapitalize upon that rhetorical knowledge in orderto help students separate the various layers ofabstraction that comprise the process of creatingtechnical documentation within a componentcontent-management workflow with DITA.

DITA DITA “is an XML-based, end-to-endarchitecture for authoring, producing, anddelivering technical information” [16, p. 1]. Asdescribed by some of its main architects, DITA

consists of a set of design principles for creating‘information-typed’ modules at a topic level andfor using that content in delivery modes suchas online help and product support portals onthe Web. [16, p. 1]

A brief history of DITA starts by explaining it as astandard published by the non-profit Organization

EVIA et al.: TEACHING STRUCTURED AUTHORING AND DITA 331

for the Advancement of Structured InformationStandards (OASIS). The DITA specification wasdonated by IBM to OASIS in March 2004 [17], “but ithas evolved substantially since that initial donationto encompass a very wide scope of requirementsindeed” [18, p. 6]. Nevertheless, DITA's authoringcore still resides on its main three informationtypes or topics: concept, task, and reference, which“represent the vast majority of content produced tosupport users of technical information” [19, p. 7].Hackos lists the following items as key benefits ofDITA for technical communicators:

• a fully tested DTD or schema for XML-basedauthoring

• a community of developers investing inimprovements to the DITA model

• an open source toolkit you can use to produceyour own output in multiple media withouthaving to invest in proprietary tools

• a thoroughly developed approach to informationdevelopment originating with OASIS and nowencompassing many other companies, largeand small, that find value in a standards-basedapproach [19, p. 9].

These benefits contribute to DITA's popularity asa component content-management solution fortechnical documentation.

Even though DITA is not the only componentcontent-management solution for technicalcontent, it does dominate industry conferencesand publications. In fact, most books about DITAcome from industry practitioners or consultantsand, in some cases, are self-published: XML Pressoffers DITA for Practitioners [18] and DITA for Print[20]; Scriptorium Press published The DITA StyleGuide [17], SDI Global released Practical DITA [21],The Rockley Group published DITA 101 [22], IBMPress offers DITA Best Practices [23], and CIDMhas Introduction to DITA [19]. However, academicdiscussions about this standard and its importanceto the discipline are scarce, and finding materials oran approach to teach it at the college level requiredyears of experimenting and collaboration betweenthe departments of English and computer science.

Computational Thinking In 2007, thismanuscript's first author became coprincipalinvestigator in the community-building project“LIKES: Living in the KnowlEdge Society,”sponsored by the National Science Foundation'sDirectorate of Computer and Information Scienceand Engineering's Pathways to Revitalized

Undergraduate Computing Education [24]. TheLIKES mission was to study the implementationof innovative computing concepts in traditionallynoncomputing disciplines, and develop andimplement “tools and techniques that enablelearning of both computing concepts and theconcepts of the disciplines” [24]. One of the keyterms that emerged during the LIKES workshopswas computational thinking, which was introducedas a problem-solving technique for tasks involvingcomputing automation.

Aho defines computational thinking as “the thoughtprocesses involved in formulating problems so theirsolutions can be represented as computationalsteps and algorithms” [25, p. 832]. Therefore, ifan XML-based component content-managementworkflow was described in computational thinkingterms, it would focus on the computational solutionto the problem of documentation chaos createdby an abundance of unstructured content: astandardized and automated repository of reusablemodules that can adapt to different users' needs.According to Wing, computing is “the automationof our abstractions,” and “the essence ofcomputational thinking is abstraction” [7, p. 3717].The computing literature offers many definitionsfor abstraction; however, most of the work hasbeen in the context of abstraction in computingprograms (see, for example, [26]–[28]) and notin other areas. Wing [7] posits that computingabstractions are richer than in other disciplines, asthey go beyond numerical abstractions and coversymbolic abstractions as well as other types, suchas algorithmic and representational.

In the case of a computational solution forstructured content, abstraction is a combinationof two elements: information representation andseparation of concerns (layers of abstraction). Theuse of abstract representations enables focusingon relevant parts of a problem. The multiple layersof abstractions allow processing (computationally)some layers without concerns about the detailsto be encountered at other layers. Computing isconcerned with automating those abstractions,and in order for that automation to be successful,computational thinking requires understanding notonly of the concepts that each layer of abstractionrepresents, but also of the relationships betweenthe multiple layers of abstraction [7].

In the particular example of developing technicalcontent with DITA, one layer of abstraction is in theability to draw upon authors’ existing knowledge ofgenres to identify unique types of content and apply

332 IEEE TRANSACTIONS ON PROFESSIONAL COMMUNICATION, VOL. 58, NO. 3, SEPTEMBER 2015

the appropriate conventions of that content type.Another is the specific way to mark up contentto produce documentation within a structuredauthoring workflow. This layer allows a technicalcommunicator to separate content from format. Inorder for the final deliverable(s) to be successfullyproduced, however, the technical communicatoralso needs to understand the relationships betweenDITA and the other layers of abstraction involvedin the publishing process, such as the conceptof single sourcing and the XSLT files and scriptsthat automate a transformation to PDF, HTML,or another deliverable method. Programmingexperiments for novice computing students havefound that computational thinking concepts,such as abstraction, have eased the transitioninto coding [29], which is part of the challenge toEnglish students who are learning to write XMLbut are so used to working in a desktop publishingenvironment such as Microsoft Word.

Just as Wing predicts that deeper computationalthinking within the domain of computer science

may enable scientists and engineers to modeland analyze their systems on a scale orders ofmagnitude greater than they are able to handletoday. [7, p. 3719]

Likewise, the use of computational thinking andabstraction within technical communication (tocreate component content-management repositoriesand deliverables) allows modeling, analyzing, andcreating systems and collections of documents thatare orders of magnitude more complex and moreefficient than ever before. This view is similar tothe abstraction ladder [30], in which each levelreveals additional characteristics of a conceptfor more convenient understanding. Thus, theseemingly daunting concept of “authoring contentin a Component Content Management workflowwith DITA” is divided into easy-to-process levelsbefore introducing any computational automation(to be performed later by the DITA Open Toolkit orspecialized software).

For the purposes of this case, we have identifiedthe following layers of abstraction that comprisethe process of creating technical documentationwithin a component content-management workflowwith DITA:

Layer 1: Developing quality documentationLayer 2: Separating content from designLayer 3: Authoring granular content with XMLLayer 4: Authoring and linking componentcontent-management modules with DITA

Layer 5: Single-sourcing and content reuse

This case reports on how an undergraduate coursemodeled on these layers of abstraction allowsstudents within a Professional Writing programand an English department to capitalize on theirexisting rhetorical strengths while effectivelyimproving their computing skills and learning tocreate documentation deliverables using DITA.

HOW THIS CASE WAS STUDIEDThis section reports on the methods used to assessthe skills that needed to be incorporated intothe course and to evaluate the course over threesemesters. First, we state the research methodology,and then we discuss the requirements for expertand student participants. Next, we discuss how thedata were collected from participants and, finally,how the data were analyzed.

Choice of a Research Methodology This case isprimarily an experience-based needs assessmentand evaluation report with Institutional ReviewBoard's (IRB’s) approval to review, analyze, andreport certain data.

Participants The case benefited from interviewswith technical communication experts fromindustry and academia, and analysis of forumreflections from students taking the course(approved by Virginia Tech’s IRB with protocols11–311 and 13–773, respectively). The primaryobjective for obtaining expert opinions fromacademia and industry was to conduct a needsassessment that contributed to the instructors'ideas on how to revise the course.

Expert participants from industry and academiawere selected based on either their publicationrecord on topics related to documentation, singlesourcing, and DITA, or their presence in onlinesocial media promoting training for structuredauthoring in the workplace or adoption of DITAor similar standards. An initial round of emailinvitations went out to the experts in 2011 andcontinued after nonresponding and decliningparticipants were discarded. Experts were asked forpermission to be quoted by name in publications,and did not receive monetary compensation fortheir participation.

Student reflection posts on their assignments weremandatory and part of their semester grades. Onlystudents enrolled in the 3000-level course CreatingUser Documentation during the Fall 2012, Spring2013, and Fall 2013 semesters were included in this

EVIA et al.: TEACHING STRUCTURED AUTHORING AND DITA 333

case. Our IRB-approved consent form specified thatstudents taking the course during the Fall 2013semester could opt out of having their commentsincluded in publications and reports. For studentswho took the course in the Fall 2012 and Spring2013 semesters, the IRB approved the analysisand quoting of posts as exempt, acknowledgingthat students had already been graded for thoseassignments and tracking them down for approvalwould be a difficult task. As instructors of recordof the course sections included in this case, weensured that participation did not affect studentgrades. Students in the sections covered by ourIRB documents (a total of 53 students: 29 femaleand 24 male) were not asked to identify or describepreviously acquired knowledge of rhetorical genresor technical writing. Instead, we assumed abaseline of skills based on the course's mandatoryprerequisites at the junior level.

How Data Were Collected An important sourceof direction and content for the course comesfrom a research-to-practice loop grounded oninput from experts on DITA, content management,and technical communication in industry andacademia. Some experts include practitioners whohave graduated from our program in ProfessionalWriting, but over the years, more structuredinterviews were conducted with influential namesincluded as follows:

• J.D. Applen, Associate Professor of English,University of Central Florida

• Dave Clark, Associate Dean of theCollege of Letters & Science, University ofWisconsin-Milwaukee

• Don Day, former Lead DITA Architect, IBM• Michael Priestley, Enterprise Content TechnologyStrategist, IBM

• Sarah O'Keefe, President, Scriptorium Publishing• Julio Vazquez, Senior Content Analyst at VasontSystems.

Interviews were conducted, according to individualpreferences, via telephone or email, with questionsrelated to the relevance of structured authoring inthe technical communication curriculum, and ideasand experiences about challenges and solutionsof adopting (or teaching about) an authoringworkflow with DITA or a similar topic-based writingmethodology. Furthermore, Sarah O'Keefe andJulio Vazquez have each visited the course to sharefirst-hand, industry experiences with students,following Don Day's advice to conduct a “brownbag” session with DITA experts.

Documenting student feedback was a key methodfor answering this case's research questionsabout the feasibility of implementing a structuredauthoring workflow with DITA in an English course.Students were required to write reflection blogsor forum posts during and after completing majorassignments in Creating User Documentation. Theposts' requirements were broadly specified in thesyllabus as follows:

These posts are intended to supplement coursereadings and class discussion, particularlyin terms of thinking through the reasonswhy technical communicators would choosecertain genres, authoring techniques, ortools, as well as how those choices affecttechnical communicators, their companies,and their audiences. You may also use thisspace to reflect on issues you encounter in theprocess of completing course assignments andsuggest ways for your peers to avoid similarissues. These posts should be a space for your‘confession’ as a technical communicator intraining.

Although the reflection posts have been a traditionin the course since 2005, only those from coursesthat took place over three semesters in 2012 and2013, approved by the Virginia Tech IRB, areincluded in this case as measures of results foreach layer of abstraction in the process of creatingtechnical documentation within a componentcontent-management workflow with DITA. Thereflection posts were created as a way to documentstudents' success and struggle stories whenadopting a DITA-based documentation workflow.The posts were not used as a tool to grade orassess student projects. Each major project,with or without DITA, was graded with a rubricembedded with detailed, qualitative feedback on themechanical effectiveness of writing, the rhetoricalimpact of the produced documents, and the qualityof technical information included.

How Data Were Analyzed During the process ofreading, commenting on, and grading the reflectionposts, we looked for examples of success stories,struggles, or frustrations related to the adoptionof a component content-management authoringenvironment with DITA. For the purpose of thiscourse, topic-based authoring and single-sourcingtechniques are as relevant to the subject matteras persuasive writing and technical accuracyof instructions; therefore, we did not see DITAas a tool. (Our process was tool-agnostic: moststudents decided to use Oxygen XML for their

334 IEEE TRANSACTIONS ON PROFESSIONAL COMMUNICATION, VOL. 58, NO. 3, SEPTEMBER 2015

projects and, thus, wrote about problems specificto that editor, but others worked in TextEdit andthe command-line interface using the DITA OpenToolkit and had different tool-related experiences.)The reflection posts worked as a virtual gauge tomeasure the adoption of DITA and understandingof a topic-based writing workflow, and did not havea rigorous analysis component attached to them.

Assuring Credibility and Trustworthiness Wewere as interested in detecting problems as wewere in looking for successful implementationsof DITA authoring workflows. Therefore, ouranalysis of reflection posts attached to each layerof abstraction attempts to cover all facets of thespectrum without bias.

ABOUT THE CASEThis section describes the process of constantrevision of syllabus, materials, and assignmentsin Creating User Documentation with the purposeof reflecting trends and the need from technicalcommunicators in industry and academia. Thesection starts with a description of the problembehind this case, and then it provides a briefdescription of the course itself. Then, we describethe process involved in developing the course as asolution to the problem, with an emphasis on thelayers of abstraction identified to introduce conceptsof component content management, structuredauthoring, and DITA from a computational thinkingand Genre theory perspective.

Problem Originally based on the production ofprint manuals and online help, the course CreatingUser Documentation needed to reflect the needs of a

field as diverse as technical communication,which includes both academics and industryprofessionals and an incredibly diverse array ofinterests, practices, and specialties. [31, p. 43]

Preliminary research in the mid-2000s pointedto topic-based writing and XML-based singlesourcing as key topics for a documentation course.Because most enterprise-level editors and toolswere cost-prohibitive for students and faculty, anaffordable solution was to explore the open-sourceDITA standard. However, many computing conceptsassociated with a DITA authoring workflow (workingon the command-line, writing XML and XSLTcode, customizing transformations, and contentreferences) were too intimidating for an audience ofstudents in an English department.

Brief Description of the Course CreatingUser Documentation is an advanced technicaldocumentation course in a Professional Writingprogram. Students come to this course withrequired experience in composition, introductorytopics in professional writing, technical editing and,most recently, writing for digital media. Since 2006,the course has experienced frequent changes andadaptions based on industry and academia trends.Table I outlines the skills that students currentlylearn in the class through a combination of thefollowing documentation assignments:

(1) Standard operating procedure in a wordprocessor

(2) Template-based how-to documentation in XML(3) Adaptation of the standard operating

procedure into DITA(4) DITA-based user assistance system(5) Multimodal/hybrid how-to project.

Process for Developing the Course This sectiondiscusses the process of determining what toteach in the course as well as how those planswere realized, guided by the layers of abstractionidentified as central to the process of developingcontent in a structured authoring environment.

Determining What to Teach: Creating UserDocumentation has been in the ProfessionalWriting curriculum at Virginia Tech since theearly 2000s. Our involvement started in 2004,and from that year until 2006, the main source ofideas on what to teach came from presentationsand talks at conferences, such as the Summitof the Society for Technical Communication.Structured authoring and DITA were frequentlymentioned at those events. Later on, social mediainteractions, industry conversations, and job adscontinued the conversation on the need for goodcontent-development skills and experience withXML and XSLT as desired characteristics of ourgraduates. From the more in-depth interviewswith experts in industry and academia, however,we learned that structured authoring should beonly one of the strong elements of a well-roundedcurriculum. As a result, our work also includedupdating other courses such as DevelopingOnline Content (formerly titled Writing for theWeb) and creating Writing and Digital Media,which is a course on multimodal authoring andsocial media management. At the same time,the topic of abstraction as a key component of aproblem-solving approach for learning computingconcepts appeared on publications and the LIKES

EVIA et al.: TEACHING STRUCTURED AUTHORING AND DITA 335

workshops, sponsored by the National ScienceFoundation, mentioned earlier in this manuscript.

From those sources of information, we weredetermined to introduce structured authoring forcomponent content-management environmentsthrough a combination of quality writing fortechnical documentation, a clear separation ofcontent and design to enable single-sourcing andcontent reuse, a brief introduction to XML andXSLT to understand the concept of granularity, andapplication of the DITA as a standard for technicalcontent.

Realizing the Plans: A computational thinkingapproach to the course helped solve the combinedproblems of creating technical documentation inDITA with nonexpensive or open-source tools inan English course. As a problem-solving method,computational thinking comprises the automationof abstractions and, in this case, the automationswere handled by the DITA Open Toolkit and freeor inexpensive DITA-aware editors. Dividing thecourse into five layers of abstraction simplifiedcomputing-related tasks for the students whiletaking advantage of their previously acquiredrhetorical skills.

Although all assignments in the course are basedon principles of minimalism in documentation andstructured authoring, not all of them are about DITAand component content management. For example,the final assignment in the course has been anexperiment in creativity and documentation, and indifferent semesters, students have created how-tovideos (with or without elements of entertainmenteducation); procedural comic books and web comics[32]; and optional hybrid DITA topics combiningtext, audio, and video [33].

Furthermore, teaching Creating User Documentationinvolves a constant search and evaluation ofreading materials and software applicationsthat make topics related to DITA, structuredauthoring, and component content managementmore accessible for an audience of undergraduateEnglish majors and minors. In the textbookarea, the course has moved since 2004 from anoriginal focus on manuals as its default genreto topic-based writing and publication. Whereasthe required textbook in 2004 was The UserManual Manual: How to Research, Write, Test,Edit & Produce a Software Manual, by Bremer[34], in the fall of 2014, the required texts wereEvery Page Is Page One: Topic-Based Writing forTechnical Communication and the Web, by Baker

[35], Developing Quality Technical Information, byCarey et al. [36], and Introduction to DITA: A UserGuide to the Darwin Information Typing Architecture,by Hackos [19]. Regarding tools and software,the evolution has been from HTML created inNotepad or TextEdit to Oxygen XML Editor, withtrial periods of XMLSpy, Serna XML, XMLmind,and even command-line transformations with theDITA Open Toolkit. These evaluations have beeninformal and based on student input, budgetaryrestrictions, and recommendations from socialmedia and trade publications. For each layer ofabstraction that guided the course's development,the following sections provide a general descriptionof each layer, its assignments, and its results basedon student feedback.

Layer 1: Developing Quality Documentation: Thefoundational structure of the course, beforegetting into any XML-related projects, is in the“technical writing process” (with an enhancedemphasis on audience analysis) introduced byPringle and O'Keefe [37]. In a 10-step process,Pringle and O'Keefe summarize the requirementsof a technical content project, with tasks thatinclude identifying deliverables, planning andoutlining, creating content, editing, and interactingwith subject–matter experts, among others.Furthermore, based on a recommendation fromIBM's Michael Priestley, a second foundationalmodel for the course came from the characteristicsof quality technical information listed by Carey etal. [37] as follows:

• Easy to use:• Task orientation• Accuracy• Completeness.

• Easy to understand:• Clarity• Concreteness• Style.

• Easy to find:• Organization• Retrievability• Visual effectiveness.

The main objective in this layer is to introducetechnical documentation as a process and not justa product. Both models provide structure for theplanning, authoring, and revising performed bystudents in the course. Following these models,students learn to see their technical content asan asset and learn about return on investmentand interacting with clients and experts. This layeralso includes an introduction to user experience,

336 IEEE TRANSACTIONS ON PROFESSIONAL COMMUNICATION, VOL. 58, NO. 3, SEPTEMBER 2015

TABLE IQUICK FACTS ABOUT THE SOLUTION

as students develop user personas and conductinterviews for audience analysis. Tools used in thislayer are those with which students are alreadyfamiliar, including a word processor (usuallyMicrosoft Word) and spreadsheets, which allowstudents full control over content and format.Therefore, within this layer of abstraction, studentsare able to focus on creating quality documentation,before adding additional technologies and tools tothat process.

Assignments. In this layer, students create adocumentation plan following a model by Pringleand O'Keefe [37]. They include detailed personasand usage scenarios as they envision a standardoperating procedure (SOP) for a series of tasksaffecting their experience as students or residentsof their college town. The layer ends with theactual SOP with a memo reporting on a qualitativeusability test with three potential users.

Results. Students commonly expressed theirsurprise at how useful the documentationplan and personas actually became whilewriting the SOP, which implies that they werebeginning to understand the importance ofplanning in the technical writing process aswell as the necessity of a focus on the user andtask-orientation—fundamental aspects of qualitytechnical documentation.

I think that writing the doc plan for the firstassignment was a great start to the classbecause it gave me a good foundation for therest of the course. Writing outlines and doc

plans will be necessary for the rest of theassignments throughout the semester, andlearning this skill early will help me develop mytechnical writing skills further

wrote a student from the fall 2012 semester.

A student from the fall 2013 semester added thefollowing:

The documentation plan provided a quickand easy checklist of topics that I needed tocover in my SOP. Referring to the plan afterI thought I was finished helped me avoidneglecting topics that I initially claimed thatI would cover, which shows its purpose andusefulness. I now understand how having adetailed documentation plan can help writersand clients avoid misunderstandings andkeep clients from making up unreasonableexpectations for the final product.

Layer 2: Separating Content From Design: Pringleand O'Keefe classify “the methodology fordeveloping technical content into four levels:”chaos, page consistency, template-based authoring,and structured authoring [37, p. 40]. In theprevious layer, students developed page-consistentdocuments in a word processor, and this secondlayer is about the process of abstraction behind theseparation of content from design through the useof templates. Baker warns about the difficulties ofteaching this layer in the following statement:

One of the hardest things about movingtechnical writers from desktop publishing to

EVIA et al.: TEACHING STRUCTURED AUTHORING AND DITA 337

structured writing is persuading them to giveup responsibility for how the final outputlooks. Writers will keep looking for ways tospecify layout, even in markup languagesspecifically designed to remove layout concerns.They understand their jobs in terms of theresponsibilities their old tools imposed onthem. [35, p. 87]

Tools used in this layer move beyond a wordprocessor and focus on open-source code-friendlyeditors (including Komodo Edit, Notepad++,TextWrangler, and others) to start writing in HTMLand XML. Students remain responsible for writingquality documentation, but this layer requiresthem to relinquish control of format in order tocreate content within an XML and template-basedenvironment.

Assignments. Students develop content in XML forpresentation on the web. They receive templatesalready created with CSS and then XSLT, whichthey cannot modify in this layer. The introductionto XML includes discussions and examples aboutits everyday uses, from RSS to KML, VoiceXML, andeven iTunes. The main project is a flat-file databaselike a collection of movies or a catalog of books.Their XML “collections” at this point consist of onelong content file with many instances of an elementin a format similar to the following:

<movie mine = “yes”><title>Back to the Future</title><director>Robert Zemeckis</director><actor>Michael J. Fox</actor><actress>Lea Thompson</actress><genre>Science fiction</genre></movie>

The XSLT templates can then be modified to sortor filter movies by student-determined criteria,and they can also customize CSS files to changeappearance, but only after they have completed thecontent for their projects.

Results. While some students found XML moredifficult to learn than others, they still understoodthe importance of using markup languages inprofessional writing contexts. According to astudent from the fall semester of 2012, “codingmeans the Internet; the Internet means publication,and publication means viewership.”

A student from the spring 2013 semester authoredthe following “confession:”

And even now, I continue to despisecoding—except for in the context of creatinguser documentation. In the context of thisclass, for some reason, I don't hate it. In factI enjoy it. Partially because I am learning auseful skill, but mostly because this course istaking a task that I have always associated withcomputer engineering and software expertiseand presenting it to me in a way that connectscode to the concepts I am familiar with:document layout, headings, chunking, and thelike. When I was able to start thinking of usingXML as simply another medium to create userdocumentation with, and not a magical formulathat makes computers work their power, I wasable to near-instantly wrap my head aroundthe styles and languages of XML. I think it'srather amazing how simply shifting the contextof coding seemed to snap it to another part ofmy brain entirely and allow me to overcome amajor hangup in a day.

Furthermore, students seemed to understandthe benefits of separating content from format.Many forum posts acknowledged the advantagesof transforming content into multiple outputs butalso acknowledged that it allowed them to focus onthe content. A student from fall 2012 said that theseparation of content from format “allow(s) us tofocus on what we know best—writing.” Later, thesame student added that “[s]tructured formattingthrough XML allows our treasured content to bequickly and easily manipulated to suit the needs ofthe platform and our audience.”

Many other students, however, were moreaccustomed to HTML, which allows more controlover the format and presentation of information.In the fall of 2012 and the spring 2013, forumpostings reflected a desire to learn more about theways to customize the transformations of XMLfiles. As a result, the fall 2013 course incorporatedmore detailed explanations of XSLT and CSS forcustomization.

Layer 3: Authoring Granular Content With XML:In their model of document engineering, Glushkoand McGrath define granularity as “the extent towhich a system contains separate components” [38,p. 633]. Component content-management-basedsystems depend on this granularity for linkingand reuse. However, before students are exposedto those advanced topics, they need to master thisthird layer of abstraction. At this point, studentsare between simple XML catalogs and specialized

338 IEEE TRANSACTIONS ON PROFESSIONAL COMMUNICATION, VOL. 58, NO. 3, SEPTEMBER 2015

grammars such as DITA. In an interview, SarahO'Keefe recommended

that XML should be a core component of(technical writing) curricula. In addition toDITA, there should be some coverage of otherstandards, such as DocBook, S1000D, NLM,and perhaps TEI. This will give students abetter understanding of DITA as standardrather than the standard.

Thus, before embracing a specific standard,this layer has the objective of introducingcomponent-based XML authoring and filteringwith the purpose of presenting modules created inXML as equivalents of more traditional technicaland professional writing genres. Tools used inthis layer include XML-specific editors such asXMLSpy and Oxygen, and students are encouragedto modify content and templates as they interactwith web-aimed transformations combining XMLand JavaScript routines.

Assignments. Students author one or more XMLdocuments with related content that then willbe linked in different ways to adapt to users'needs thanks to XSLT filters and JavaScript DOMmanipulations. In different semesters, the mainproject has been a detailed online cookbook withrecipes reflecting a specific theme or type of cuisine(the XML files move around the repeating element

and its many children), a summary of aspecific college football game (with instances ofthe element following an account ofand ), and quick-start guides for electronicsand gadgets (with the DITA-like element ofincluded in a parent ).

Results. As the students advance to morecomplicated linking and transforming processeswith XML, the layers maintain a connecting threadfrom the rhetorical purpose of each element orunit of content. A step within a recipe or a playwithin a quarter performs a certain action and,therefore, should be written in a certain way.Students' existing knowledge of genres and theirconventions allowed them to understand thislayer of abstraction more readily. For example, inthe forums across all three semesters, studentsdiscussed the differences between differentelements in each unit. In the unit on XML, theyhelped clarify for each other the difference betweentips and notes, and in the DITA-specific units, theydiscussed the conventions of short descriptions,steps, and unordered steps. This genre-basedfocus on the purpose and conventions of each

element naturally led to a focus on the purpose andconventions of each topic within the DITA standard.

This layer was really challenging for some students,as this post from the fall 2013 semester shows:

The XML project was frustrating and confusingat first It took a while to understand thatwe had to write the XML code, then also havecorresponding XSLT and HTML pages for eachmodule. However, the process wasn't so badafter I got the hang of it. It was convenient thatI was able to copy & paste the majority of mycode and only had to change a few numbersor words depending on which gadget/moduleI was writing for. Writing the code itself wasdefinitely the most time-consuming part of theproject, but it wasn't difficult.

Similar success-after-struggle stories from thatsemester include statements like “once I saw mywork show up in the browser, I had a boost ofconfidence and encouragement. That's what makesXML so fun, but frustrating at the same time. It iseither right or wrong. There is no in between withXML,” and

the XML project wasn't exactly fun or enjoyable,but it was rewarding to see the end results. Ithought of the project like a puzzle too becauseit had so many different pieces that had tobe put together correctly for the page to workproperly.

These stories reveal that while students initiallyhad trouble producing a large amount of code,they eventually began to see its utility in creatingthe various pieces of granular content that eachperforms a specific function in the final deliverable.

Layer 4: Authoring and Linking ComponentContent-Management Modules With DITA: Atthe end of layer 3, students were satisfied withthe results of their homebrewed structuredauthoring environment. This next layer starts bycomparing their XML template to an internationalstandard by asking “what if someone in anotherschool or another country decided to addrecipes/games/gadgets to your project?” As anopen source, XML-based, international standard forstructured authoring, DITA is at the core of layer4. Building on the concept of genre, the discussionof DITA starts with understanding the usage casesfor the topics of concept, task, and reference.In addition, the module explores how studentsbuild genre systems through topic-linking and thedevelopment of DITA maps to filter and organizetopics for specific deliverables. The overall objective

EVIA et al.: TEACHING STRUCTURED AUTHORING AND DITA 339

of this layer is to make students comfortablewith topic-based writing and the functions andresponsibilities of a DITA author. Tools usedinclude open-source and inexpensive DITA-awareeditors and the DITA Open Toolkit.

Assignments. An effective introduction to DITAstarts with content with which the students arealready familiar. In this layer, the main project isto convert the SOP created in a word processorduring layer 1 into valid DITA topics. Conversionof legacy documentation to DITA is not a simplecopy-and-paste process. Rockley et al. point outthat

the ease of the conversion is only as good asthe quality of the content to begin with. If theauthors were consistent in how they createdcontent and content used templates there is agood chance that the content can be convertedprogrammatically, but sadly that is usually notthe case. [22, p. 59]

Students need to envision their SOP, originallycreated for print delivery, as a web or mobilesystem. Sections become topics, and tables ofcontents transform into maps as students createcontent inventories to see what's missing orwhat needs to be cut in the conversion. Finaldeliverables include a PDF version of the SOP and acorresponding web help system.

Results. Students were able to transfer theirknowledge of genre conventions by thinking of eachtopic type within DITA as its own genre with itsown, unique constraints and structure. A fall 2012student wrote that

one of the most helpful elements ofunderstanding DITA is grasping how there aredifferent types of markups based on the type ofinformation the document contains. With eachdifferent type of document I typed up for bothassignments, I put myself in a certain frame ofmind for completing each type of markup basedon the main three categories.

Students were even able to see how the DITA topicswere useful in projects that took place outsideof a structured authoring environment (videotutorials, for example), demonstrating that theyunderstood the relationships between multiplelayers of abstraction well enough to transfer thatabstraction to documentation in other media. Astudent from the spring of 2013 said, “I found itvery interesting that without even realizing it we areusing the same concepts from our DITA projects in

our tutorial, and how everything seems to matchup with a DITA element.”

Furthermore, within this layer students beganunderstanding the sequence of assignments forthemselves, not necessarily as layers of abstraction,but as a logical sequence that allows them to buildon what they learned earlier in the semester. A fall2013 student wrote, “I agree that the DITA projectwas much easier than the XML project. I think thatthis was the case because we already were familiarwith XML before we started this project. If we haddone this project before doing the XML project, Ibelieve it would have been overwhelming.”

Layer 5: Single-Sourcing and Content Reuse: Alongstanding definition of single-sourcing oftechnical documentation is the process of

writing information once and using it manytimes. It does not mean writing it and thencopying and pasting it into another source, ormodifying the information for different needssuch that you have multiple sources [39, p.189]

Additionally, Bellamy et al. mention that “in theworld of DITA, reuse is about writing content onceand reusing that content whenever it's needed” [23,p. 183]. The purpose of this layer is to introducestudents to the concepts of single-sourcing andreuse. In DITA, maps represent an accessible entrypoint to single-sourcing, and reuse can be achievedeasily through content references (conrefs). Theconref “is a method of transclusion, which meansthat you can include a topic or part of a topic inanother topic by using a reference” [23, p. 184]. Inthis layer, students write content once, and thencreate multiple deliverables for different audiences,using some unique topics and some reused topics.Tools used in this layer include DITA-aware editorslike Oxygen with advanced functions, such asconditional processing and filtering. In addition,students learn about customization of DITAdeliverables, incorporating personalized CSS filesto their web products, and exploring the use ofplug-ins for PDF and Open Document Format forOffice Applications (ODF).

Assignments. Following a recommendation fromJulio Vazquez, formerly of IBM and SystemsDocumentation, Inc., the main project in this layeris a fictional scenario asking students to preparebreakfast recipes in DITA for different audiences.The assignment introduces “Armando the veganroommate” and a bacon-loving mother who visitsa student. Both need breakfast instructions and

340 IEEE TRANSACTIONS ON PROFESSIONAL COMMUNICATION, VOL. 58, NO. 3, SEPTEMBER 2015

recipes, but Armando requires a web help ormobile-friendly deliverable respecting his dietaryrestrictions, while “mom” wants a PDF with plentyof meat. The assignment description specifiesthat each deliverable “should include at least fourdifferent recipes, and the point of reuse is thatsome overlap is expected, or in this case, required:you will need at least two recipes with a conditionalingredient (or ingredients) that could be adaptedfor vegan and non-vegan users (example: replacetofu for eggs).” A recipe can have audience-basedconditional processing instructions as follows:

<step audience=“mom”>

<cmd>Flip bacon with tongs after 5 minutes ofcooking.</cmd>

</step>

<step audience=“armando”>

<cmd>Flip vegan bacon alternative with tongs after7 minutes of cooking.</cmd>

</step>

Also, with a conref, a step written once,

<step id=“stove”> <cmd>Place pan on stovetop.</cmd></step>

can be called in another topic,

<step conref=“t-bacon.dita#bacon/stove”>.

This assignment still requires students to createquality documentation within a DITA environment,but the added complexity of reuse and conditionalprocessing at this layer requires that they envisionways to reuse and adapt that content acrossvarious types of deliverables for different audiences.

Results. After mastering layer 4, most studentsseemed to appreciate the flexibility andcustomization options afforded by layer 5. A fall2012 student, referencing Rockley et al.’s [22]discussion of content collections as a “library ofcontent,” said that

the DITA maps function like the Dewey Decimalsystem, organizing and calling upon files asthey are needed. The structured authoring styleof DITA allows for content to easily be reusedand the maps are the defining factor in how thetopics are organized for use.

In the spring of 2013, a student commented onthe following:

After two separate assignments with DITA, Ican see the reason why people may use it. It'san easy way to single-source your informationand be able to re-use it. Obviously, that's agreat ability to have. I'm currently planningmy Writing for the Web (a senior-level course inthe program) final paper that has to do withresponsive designs. I've talked about how itis way more efficient to create one responsivedesign that works for mobile phones, tablets,and computers. This is sort of a similarsituation here. You can create one thing of‘content’ and then source it out to a PDF,e-book, or HTML file. Then, whenever youhave to make a change, you can make it onetime. Then, transform your file to the differentformats and you're good. So I can definitelysee the benefit of that.

Talking about the final class project with DITA,which by fall of 2013 incorporated team-basedwork, a student said

“I couldn't agree more that this project ishelpful. Many professional/technical writerswork in teams and collaborate on projects.That is why DITA is so great to use becausemaps and topics can be linked and re-usedwith others so easily.

A classmate added that

after doing the DITA Breakfast project, thisproject was super easy to do. I appliedeverything that I learned in the first project,but on a bigger scale. I also got really into thecustomizations for the WebHelp and the PDF.After customizing the project it seemed like areal document that could actually be used bysomeone instead of just a project for a class.

These final reflections confirm that separating thevarious layers of abstraction involved in the processof creating technical documentation with DITA notonly allows students to more easily adapt to a DITAenvironment but also to understand the rhetoricalaffordances of the technology.

CONCLUSIONS, LIMITATIONS, AND SUGGESTIONSFOR FUTURE RESEARCHThis section presents conclusions from the teachingcase, introduces suggestions for improvement andadoption in similar professional and technicalwriting programs, and forecasts ideas for future

EVIA et al.: TEACHING STRUCTURED AUTHORING AND DITA 341

research on the production and delivery of technicaldocumentation authored in DITA.

Conclusions Creating User Documentation isa difficult course to take and to teach. Fromthe instructor's perspective, it requires constantupdates to reflect trends and preferences inindustry and research in academia. Therefore,it involves keeping an eye on the proceedingsand reports of conferences and meetings on bothsides of the technical communication professionalspectrum. From the student perspective, theassignments can be challenging but rewarding.However, the course is not universally loved,and more than once a student's reflection hasimplied that it looks more like a Computer Scienceclass than an English course. A solution to thisproblem is to avoid overemphasizing the coding andautomation side of DITA and XML and to maintaina solid grounding on writing principles and goodtechnical communication requirements.

The course is a modest attempt to answer thecall broadcasted by Andersen [1] for unitingefforts in industry and academia to facilitatethe adoption and understanding of componentcontent-management solutions. Some graduatesof the program become technical communicationpractitioners and join the workforce with a solidknowledge of structured authoring and DITA. Byintroducing an abstraction-oriented computationalthinking approach, the layers build over time andgive students opportunities to master one levelbefore addressing the next one, and it helps thatautomation comes on the DITA Open Toolkit andstudents do not need to learn a programminglanguage.

A student from the fall 2012 semester summarizedher experiences during the course in the followingpost, which was appended to the first assignment's(SOP) forum:

Okay so this is extremely late coming, but Ithink it's interesting to compare where we werefor this assignment to the place we are in nowwith DITA. I say this because we are actuallyusing information we learned from this sectionand the assignments that we did for this partof the class to complete websites now. I canhonestly say that I didn't think that I would beable to make half of the things that we havein class and I'm kind of impressed with myselfconsidering how bad I am at technology (myown fault since I don't like to sit around andplay with technology and I get frustrated easywith it).

By acknowledging that the skills learned in thecourse's first layer of abstraction are still relevantto the final layer, this student is confirming thatthe concept of abstraction that guided coursedevelopment did, in fact, allow students to buildeach new skill on top of those learned in previouslayers.

Limitations A major limitation of this case isthe lack of information about our graduates whobecome technical writing practitioners. As ofApril 2014, we do not have a complete databasedocumenting placement and employment forProfessional Writing graduates. From informalconversations on social media, we only have threedocumented cases of graduates who work orworked with structured authoring in the workplace:two of them work with MadCap Flare and one hasconducted work with DITA and DocBook.

Also, one semester does not allow for a fullexploration of DITA specialization, including theLearning and Training Content Specialization, oran additional layer of abstraction for advancedcustomization with plug-ins. And budgetarylimitations also complicate the adoption of moreuser-friendly, enterprise-level DITA editors, or evena more robust solution like Adobe FrameMaker.However, the deliverables created by students inthis course reflect proper balance of rhetorical,persuasive writing, accurate technical content,and modest computing implementation. Studentsfrom Creating User Documentation frequentlypresent about the process of creating their DITAassignments at the Virginia Tech undergraduateEnglish research conference, and some haveparticipated in related internships and researchprojects in their senior semester.

For possible adoption of this case in similaracademic programs, a limitation could be thecourse sequencing at our institution that includesCreating User Documentation. The course'sprerequisites covering fundamentals of Genretheory, and more advanced offerings on web designand content strategy, create a unique environmentfor our students. From a technology perspective,possible adoption of this case in similar programscould take some of the abstraction levels andapply them to topic-based authoring in a Wiki-likeenvironment, or even on HTML for the web. DITAis not a software program, and students wholearn about topics and single sourcing usingDITA can apply those principles to blogs and evenword-processing-based documents.

342 IEEE TRANSACTIONS ON PROFESSIONAL COMMUNICATION, VOL. 58, NO. 3, SEPTEMBER 2015

Suggestions for Future Research Reportingfrom a discussion about research topics thatcould unite industry and academia in technicalcommunication, Benavente and Clark [40]identified a few lines of inquiry that were deemedas relevant for both sectors. These lines includereception studies, content strategy studies, andmetrics/measurement studies, among others.All of those lines could connect to the process ofauthoring technical documentation in DITA. Inaddition, academia needs to conduct research toinnovate the field and not just reproduce whatindustry is already doing. It could be interesting

to continue exploring the production of DITAdeliverables for small screens and mobile devices,the development of hybrid topics with multimodalcontent, and a combination of how-to videos withprinciples of structured authoring and componentcontent management.

ACKNOWLEDGMENTS

Katherine Mawyer conducted some of the interviewsquoted in this paper as part of an undergraduateresearch project supervised by the first author.

REFERENCES[1] R. Andersen, “Component content management: Shaping the discourse through innovation diffusion research

and reciprocity,” Tech. Commun. Quart., vol. 20, no. 4, pp. 384–411, 2011.[2] K. Schengili-Roberts. (2013). What size are DITA-using firms and where is DITA going?DITAWriter.

[Online]. Available: http://www.ditawriter.com/what-size-are-dita-using-firms-and-where-is-dita-going/?utm_medium=twitter&utm_source=twitterfeed

[3] S. O'Keefe and A. Pringle, The state of structure, 2011, Cary, NC, USA, 2011.[4] B. J. G. McShane, “Why we should teach XML: An argument for technical acuity,” in Content Management:

Bridging the Gap Between Theory and Practice, G. Pullman and B. Gu, Eds. Amityville, NY, USA: Baywood,2009, pp. 73–85.

[5] M. F. Eble, “Digital delivery and communication technologies: Understanding content management systemsthrough rhetorical theory,” in Content Management: Bridging the Gap between Theory and Practice, G. Pullmanand B. Gu, Eds. Amityville, NY, USA: Baywood, 2009, pp. 87–102.

[6] J. M. Wing, “Computational thinking,” Commun. ACM, vol. 49, no. 3, pp. 33–35, 2006.[7] J. M. Wing, “Computational thinking and thinking about computing.,” Philos. Trans. A. Math. Phys. Eng.

Sci., vol. 366, no. 1881, pp. 3717–3725, Oct. 2008.[8] College Board, Computer Science: Principles Computational Thinking Practices Big Ideas, Key Concepts,

Supporting Concepts, 2012.[9] L. Bitzer, “The rhetorical situation,” Philos. Rhetor., vol. 1, no. 1, pp. 1–14, 1968.[10] C. R. Miller, “Genre as social action,” Q. J. Speech, vol. 70, no. 2, pp. 151–167, May 1984.[11] D. R. Russell, “Rethinking genre in school and society,” Writ. Commun., vol. 14, no. 4, pp. 504–554, 1997.[12] J. Hackos, Content Management for Dynamic Web Delivery. Hoboken, NJ, USA: Wiley, 2002.[13] C. Robidoux, “Rhetorically structured content: Developing a collaborative single-sourcing curriculum,” Tech.

Commun. Quart., vol. 17, no. 1, pp. 110–135, 2008.[14] A. J. Devitt, Writing Genres. Carbondale, IL, USA: Southern Illinois University Press, 2004.[15] C. Bazerman, “Systems of genres and the enactment of social intentions,” in Genre and the New Rhetoric, A.

Freedman and P. Medway, Eds. New York, USA: Taylor & Francis, 1995, pp. 79–104.[16] D. Day, M. Priestley, and D. Schell. (2005). Introduction to the Darwin Information Typing

Architecture: Toward portable technical information. IBM developerWorks. [Online]. Available:https://www.ibm.com/developerworks/library/x-dita1/

[17] T. Self, The DITA Style Guide: Best Practices for Authors. Research Triangle Park, NC, USA: ScriptoriumPublishing, 2011.

[18] E. Kimber, DITA for Practioners Volume 1: Architecture and Technology. Laguna Hills, CA, USA: XML Press,2012.

[19] J. Hackos, Introduction to DITA, 2nd ed. Denver, CO, USA: Comtech Services, 2011.[20] L. W. White, DITA for Print: A DITA Open Toolkit Workbook. Laguna Hills, CA, USA: XML Press, 2013.[21] J. Vasquez, Practical DITA. Raleigh, NC, USA: Write Spirit, 2009.[22] A. Rockley, S. Manning, and C. Cooper, DITA 101: Fundamentals of DITA for Authors and

Managers. Schomberg, ON, Canada: Rockley Group, 2009.[23] L. Bellamy, M. Carey, and J. Schlotfeldt, DITA Best Practices: A Roadmap for Writing, Editing, Architecting in

DITA. Upper Saddle River, NJ, USA: IBM Press, 2012.[24] E. Fox, C. Evia, W. Fan, S. Sheetz, and C. Zobel. (2007). Living In the KnowlEdge Society (LIKES),NSF Award

Abstract. [Online]. Available: http://www.nsf.gov/awardsearch/showAward?AWD_ID=0722259&Histori-calAwards=false

[25] A. V. Aho, “Computation and computational thinking,” Comput. J., vol. 55, no. 7, pp. 832–835, 2012.[26] J. Bennedsen and M. E. Caspersen, “Abstraction ability as an indicator of success for learning computing

science?,” in Proc. 4th Int. Workshop Comput. Educ. Res., 2008, pp. 15–26.

EVIA et al.: TEACHING STRUCTURED AUTHORING AND DITA 343

[27] P. Frorer, M. Manes, and O. Hazzan, “Revealing the faces of abstraction,” Int. J. Comput. Math. Learn., vol.2, no. 3, pp. 217–228, 1997.

[28] B. Haberman, “High-school students' attitudes regarding procedural abstraction,” Educ. Inf. Technol., vol.9, no. 2, pp. 131–145, 2004.

[29] H. C. Webb and M. B. Rosson, “Using scaffolded examples to teach computational thinking concepts,” in Proc.44th ACM Tech. Symp. Comput. Sci. Educ., 2013, pp. 95–100.

[30] S. I. Hayakawa and A. R. Hayakawa, Language in Thought and Action, 5th ed. Orlando, FL, USA: Harcourt,1990.

[31] R. Andersen, S. Benavente, D. Clark, W. Hart-davidson, and C. Rude, “Open research questions foracademics and industry professionals?: Results of a survey,” Commun. Des. Quart., vol. 1, no. 4, pp. 42–49,2013.

[32] C. Evia, M. Stewart, T. Lockridge, S. Scerbo, and M. Perez-Quiñones, “Structured authoring meets technicalcomics in techcommix,” in Proc. 30th ACM Int. Conf. Design Commun., 2012, pp. 353–354.

[33] C. Evia, S. Healy, and T. Lockridge, “Evaluating a workflow for authoring multimodal DITA,” in Proc. 31stACM Int. Conf. Design Commun., 2013, p. 185.

[34] M. Bremer, The User Manual Manual: How to Research, Write, Test, Edit and Produce a SoftwareManual. Concord, CA, USA: UnTechnical Press, 1999.

[35] M. Baker, Every Page is Page One: Topic-based Writing for Technical Communication and the Web. LagunaHills, CA, USA: XML Press, 2013.

[36] M. Carey, D. Longo, M. McFadden Lanyi, E. Radzinski, S. Rouiller, and E. Wilde, Technical Writing 101: AReal-World Guide to Planning and Writing Technical Content, 3rd ed. Research Triangle Park, NC, USA:Scriptorium Publishing, 2009.

[37] A. Pringle and S. O'Keefe, Developing Quality Technical Information: A Handbook for Writers and Editors, 3rded. Upper Saddle River, NJ, USA: IBM Press, 2014.

[38] R. J. Glushko and T. McGrath, Document Engineering: Analyzing and Designing Documents for BusinessInformatics and Web Services. Cambridge, MA, USA: MIT Press, 2005.

[39] A. Rockley, “The impact of single sourcing and technology,” Tech. Commun., vol. 48, no. 2, pp. 189–193, 2001.[40] S. Benavente and D. Clark. (2013). Using research to examine beliefs and habits,CIDM Inf. Manage. News.

[Online]. Available: http://www.infomanagementcenter.com/enewsletter/2013/201303/second.htm

Carlos Evia is an associate professor of TechnicalCommunication in the Department of English at VirginiaPolytechnic Institute and State University, Blacksburg, VA, USA,where he also directs the program in Professional and TechnicalWriting. He conducts research on multicultural user assistancefor the Virginia Tech Centers Human–Computer Interaction andInnovation in Construction Safety and Health. Prof. Evia is amember of the DITA Technical Committee with the Organizationfor the Advancement of Structured Information Standards.

Matthew R. Sharp is an assistant professor of Communicationin the Humanities and Communication Department atEmbry-Riddle Aeronautical University, Daytona Beach, FL, USA.With a background in professional writing, web development,fundraising, and enrollment management, his research analyzesorganizational activity systems and their mediating technologiesfrom both cultural and rhetorical studies perspectives.

Manuel A. Pérez-Quiñones is an Associate Dean of the Collegeof Computing and Informatics and Professor in the Departmentof Software and Information Systems at the University of NorthCarolina at Charlotte, Charlotte, NC, USA. His research interestsinclude personal information management; human–computerinteraction; user interface software; as well as educational,cultural, and diversity issues in computing. He is a member ofthe Coalition to Diversify Computing.