Research Division Reflection

It’s hard to believe that the first year fellows have already completed our first rotation within a division. I was nervous to begin the fellowship in the research division, since I’m not super-technical (I was rightly told that I can no longer claim to not be a “technology person”), but I have had quite a learning experience. I learned new skills – I can now effectively explain to someone what a plugin actually does and how it works – and went out of my comfort zone in learning Python.

In our first week, we began with PressForward. After playing around with the sandbox site, I installed the PressForward plugin onto my dev site to get a better handle of how it worked. Once I was more comfortable with the logistics of the plugin I moved on to working as an editor-at-large of Digital Humanities Now. It was incredibly interesting to see how the plugin can be used for academic purposes and how it aggregates and organizes content. I was astounded by the quantity of content that was part of the all content feed, especially since a disproportionate amount of the posts were not related to digital humanities.

In our second week, we shadowed Tuesday’s editors-in-chief, Amanda and Mandy, and watched them go through the process of examining the articles under review and deciding which pieces should be published. Prior to Thursday, I familiarized myself with the editors-at-large corner and read several editors’ choice articles. I especially enjoyed reading “Thoughts on feminism, digital humanities and women’s history,” since my area of research is women and gender. On Thursday we were editors-in-chief, which was such a fun experience.

It was beneficial to begin work with PressForward from the ground up. We started with the sandbox, moved on to seeing how the plugin worked for DH Now, and then used the plugin to publish an issue of DH Now. It is a fantastic tool for disseminating often overlooked material to a wide audience and for collecting and curating information. Overall, I had a positive experience with PressForward and DH Now.

After PressForward, we started learning Python through the Programming Historian lessons. I had minimal experience using HTML, CSS, and XML to create a website from scratch when I was in library school, but programming is not something I am comfortable with. At first Programming Historian was fairly easy and the first few lessons seemed straight-forward, but once I got past the “Manipulating Strings in Python” I started to feel lost. After completing those lessons I moved onto the Zotero API lessons. These were more difficult for me to comprehend, especially since, as Stephanie pointed out, they are not in layman’s terms. With help from Jordan and Spencer, I was able to get through the lessons using the sample Zotero library.

I cultivated my own Zotero library and then went back through the API lessons using it instead of the sample in order to see how much of the lessons I could understand on my own. I was successfully able to get through the first two lessons, which was very exciting. I ran into some problems with the third lesson when Text Wrangler was not reading the URLs from the first two items in my library. It was working when I used the sample library because the URLs are links to simple HTML pages, but the links in my library are linked to more complicated sites, such as the source’s record in EBSCO. Jordan had discovered another problem earlier with the user and group tags, and I went into GitHub and reported both of our problems. I am excited to see how I will use Python in the future with other digital humanities projects.

It was an illuminating contrast to work with both PressForward and Python and to see how the latter influences the former. I can understand why we began in the research division since the technical skills we learned are necessary in order to have a solid foundation and understanding of digital history.

Reflections on My First Year as a DH Fellow

This has been a very eventful and exciting first year for me working within the Center for History and New Media. As I mentioned in my introduction post, I came in with a vague familiarity with CHNM from my previous institution. However, being immersed within it helped me to form a much greater understanding of what the Center does, what the possibilities of Digital History are, and where I fit into that picture for my career and studies.

First, the inclusion of the DH Fellows into the different divisions throughout the year was extremely helpful for me. Through this process, I learned how the Center works, as well as the different projects that were available. I came out way more knowledgeable about Omeka, PressForward, and how projects such as Sea of Liberty come to life.

Beyond the actual projects, one of the primary benefits of being a DH Fellow was the establishment of communication and networks that I feel will be incredibly helpful for me continuing forward. With the various projects, such as our project that scraped THATCamp data (those posts are here in five parts-one, two, three, four, five), we were able to communicate with people within CHNM for assistance. That process was immensely helpful, as well as the Digital Campus podcast sessions that let me engage in the process of thinking about the implications of current events for digital history, as well as participating in an experience working with major players in the field.

As well as working with people that are currently doing Digital History, I was also able to work with my peers that were in the Clio 1 and 2 classes. The DH Fellows, through the suggestion of Spencer Roberts, created the DH Support Space that met every week. This support space was a great addition to the DH Fellowship, as it allowed us to both use the tools we had learned in class and at CHNM, as well as to assist other students in issues they had. This let me grow my skills and learn in the process of trying to help other students with their assignments and projects.

Lastly, I believe one of the most useful aspects of my first year was the seminar. First, we researched what made a digital humanities center, and which ones are still around today. This then led into researching the history of the Center for History and New Media. Each of the first year DH Fellows was given a project to research, and I picked teachinghistory.org. This project let me dive into grants, documents, and how our projects played into the history of the Center. It was very helpful in learning where the Center fits into the larger context of Digital History, which is significant (and helpful!) for my minor field of Digital History.

I feel that overall, this entire year has been incredibly useful and helpful for me, and I have learned so much from working here. I look forward to assisting others this year, mentoring, and continuing to learn the process of doing Digital History through my assignment as a DH Fellow in my second year.

Reflections on the Spring Semester and Year 1 as a Digital History Fellow

It seems like just yesterday we walked into the Center for History and New Media a bit unsure about what our first year as DH fellows would entail. Looking back it has been an extremely rewarding and valuable experience. Last fall we blogged about our rotations in both the Education and Public Projects divisions. In the Spring we moved to Research for seven weeks where we worked on a programming project for THATCamp and on the PressForward project before moving onto a seminar about the history of CHNM. I want to use this blog post to reflect on the spring semester and look back at the year as a whole.

Our first stop during the spring semester was the Research division. We began our seven weeks by taking on a topic modeling project which aimed to mine all the posts from the THATCamp individual websites and blog about the process. As we used the Programming Historian to learn python (or at least attempt to), we thought a lot about tools and the scholarly research process. We discussed Zotero as a tool and the values and community behind THATCamp as a training network and community for the Digital Humanities. Although we struggled with the programming aspect of this assignment and managed to miss important concepts behind Topic Modeling, the assignment gave us some insight into what kinds of challenges and opportunities topic modeling holds. From this project I learned first hand the importance of understanding the black box behind Digital Humanities tools. After finishing with our topic modeling project we moved onto the PressForward project. We spent a week working as Editors-at-Large and helped second year fellow Amanda Morton with her Editor-in-Chief duties. Thinking about scholarly gray literature and measuring reception of scholarly works on the internet we also spent time researching AltMetrics.

At the end of the three rotations we were left with a very clear understanding of each division, its current and past projects, the audiences it creates for and the overlap between each division. We then began a seminar with Stephen Robertson that explored the history of RRCHNM. In this seminar we tried to understand how RRCHNM developed over the years into its current state and how RRCHNM fits into the larger history of the digital humanities. Beginning with an overview of what a Digital Humanities Center is and how its defined, we collaboratively looked at all 150 centers in the United States and tried to get a sense of the different models that exist and just how many actually fit the definition of a digital humanities “center” as defined by Zurich. What we realized is that the Center for History and New Media stands out from other Digital Humanities centers due to its unique attachment to the History Department but also because of the origins of the center and because of Roy Rosenzweig’s vision.

After we defined just what a center was and looked at the different models, we started to look at the origins of RRCHNM and try to create a genealogy of the different projects and trace the development of the center. Each of the first year fellows took a different major project and traced its history through grant documents and reports. I read up on Zotero in its different iterations and learned a lot about how Zotero was originally conceived as well as how it has grown, expanded, and changed since 2004.

I think one of the things that has been immensely useful for the first year fellows is the ways much of our work at the center was paralleled by our coursework. In the PhD program at GMU we’re required to take a two course sequence in digital history. The first sequence focuses on the theory of Digital History and the second is largely a web design course that introduces us to the basics of HTML and CSS. Often times the topics in Clio I related directly to why we were doing at the center and the dual exposure allowed us to see the application of things we had discussed in Clio first hand.

At the suggestion of Spencer Roberts, the fellows decided to begin a Digital History Support Space in the Fall. The support space offers “advice, guidance, and assistance for students doing digital history projects.”  Every Monday from noon to 5pm (and sometimes even on weekends) we met with students taking the Clio courses, offered advice about and brainstormed potential projects, helped to debug code, and offered a space to work where help was available if needed. We were able to draw on experience from the center and offer advice about what kinds of tools are available and where resources might be found. We weren’t experts but working with the other students in our Clio classes was equally beneficial. It left me with a better understanding of the issues, topics, and tools discussed in our classes. As many of the PhD students move onto Clio III: Programming for Historians with Lincoln Mullen this fall, I’m looking forward to continuing the Support Space.

The fellowship has been structured in such a way that each element has built on itself to provide us with experience and an understanding of digital history, digital humanities, and the debates, methodologies, and histories of the discipline. This fall I’ll be working in the Research Division on the PressForward project and helping to manage both Digital Humanities Now and the Journal of Digital Humanities. Our first year as Fellows has gone by extremely fast but I’m looking forward to beginning a new year and moving into the role of mentor to the new group of DH Fellows.

THATCamp Mallet Results

We have spent the last few weeks working to build a python script that would allow us to download and prep all of the THATCamp blog posts for topic modeling in MALLET (for those catching up, we detailed this process in a series of previous posts). As our last post detailed, we encountered a few more complications than expected due to foreign languages in the corpus of the text.  After some discussion, we worked through these issues and were able to add stoplists to the script for German, French, and Spanish.  Although this didn’t solve all of our issues and some terms do still show up (we didn’t realize there was Dutch too), it led to some interesting discussion about the methodology behind topic modeling.  Finally we were able to rerun the python script with the new stopwords and then feed this new data into MALLET.

Continue reading

Unexpected Challenges Result in Important and Informative Discussions: a transparent discussion about stripping content and stopwords

As described in previous posts, the first year Digital Fellows at CHNM have been working on a project under the Research division that involves collecting, cleaning, and analyzing data from a corpus of THATCamp content. Having overcome the hurdles of writing some python script and using MySQL to grab content from tables in the backend of a WordPress install, we moved on to the relatively straightforward process of running our stripped text files through MALLET.

As we opened the MALLET output files, excited to see the topic models it produced, we were confronted with a problem we didn’t reasonably anticipate and this turned into a rather important discussion about data and meaning.

Continue reading

Pre-processing Text for MALLET

In our previous post, we described the process of writing a python script that pulled from the THATCamp MySQL Database. In this post, we will continue with this project and work to clean up the data we’ve collected and prepare it for some analysis. This process is known as “pre-processing”. After running our script in the THATCamp database all of the posts were collected and saved as text files. At this stage, the files are filled with extraneous information relating to the structure of the posts. Most of these are tags and metadata that would disrupt any attempts to look across the dataset. Our task here was to clean them up so they could be fed into MALLET. In order to do this, we needed to strip the html tags, remove punctuation, and remove common stopwords. To do this, we used chunks of code from the Programming Historian’s lesson on text analysis with python and modified the code to work with the files we had already downloaded.

Continue reading

Extracting Data from the THATCamp Database Using Python and MySQL

This week we’ve continued to work on building a python script that will extract all of the blog posts from the various THATCamp websites. As Jannelle described last week, our goal was to write a script that downloads the blog posts in plain text form and strips all of the html tags, stopwords, and punctuation so that we can feed it into MALLET for topic modeling and text analysis. After several long days and a lot of help from second year fellow Spencer Roberts, we’ve successfully gotten the code to work.

Continue reading

Spring Semester in Research and a THATCamp Challenge

The spring semester is here and the first year DH fellows have begun our rotation into the Research division of CHNM.

To get the ball rolling, we spent a week working through the helpful tutorials at the Programming Historian. As someone new to DH, with admittedly limited technical skill and knowledge, these were immeasurably useful. Each tutorial breaks content into smaller, less intimidating units. These can be completed in succession or selected for a particular topic or skill. While there is useful content for anyone, we focused our attention on Python and Topic Modeling with the aim of solving our own programming dilemma.

Our central challenge was to extract content across the THATCamp WordPress site to enable us to do some text analysis.

Continue reading

Reflections: Year Two, Semester One

As the first term of 2013-14 closes, it seems appropriate to reflect on the experiences of the Digital History Fellows. Last year, our first cohort of DH Fellows spent the first semester meeting with Dan Cohen, learning the history of the center, discussing current projects, and thinking about how digital history is practiced. We spent our second semester working in each of the divisions for five weeks, and then decided in which division we would like to work in the second year. Although there was no specific requirement that we take positions spread across the three divisions, we were drawn in different directions. From the first days of the fellowship, Ben Hurwitz was most comfortable in Education and quickly entrenched himself at their community table. He now works on various educational projects, including the Popular Romance Project. Amanda Morton worked closely with Fred Gibbs before he relocated to New Mexico, which helped her transition into Research, where she works on Digital Humanities Now and related PressForward projects. Spencer Roberts was drifting toward Public Projects before the summer started, and settled in once the center received a grant to work with the National Park Service to revamp their War of 1812 site.

This year we welcomed three new members into the fellowship, bringing our total number to six. The second cohort follows a different schedule in their first year, so Amanda Regan, Anne Ladyem McDivitt, and Jannelle Legg stepped directly into the mix at RRCHNM, splitting their semester into seven-week blocks in Education and Public Projects. During those weeks, they have written reflective posts about the projects to which they’ve contributed, all of which can be found here. Next term, they will spend a block in Research before moving into a final seminar with Stephen Robertson.

Continue reading

A Bit of Reflection on Pressforward Projects

It’s interesting to be on the other side of the production of something like DHNow/JDH. Not only does sorting through material for each offer a unique opportunity to explore current events and conversations in the digital humanities, but this process also encourages deeper examination of blog posts and white papers to pull out threads of argument and evidence that can be used to connect disparate conversations across fields. Archaeologists and manuscript historians share common interests with those working in hard sciences and linguistics, although their work is rarely presented in the same forum. Part of what JDH adds to the DH community is this willingness to collect and edit work from across several disciplines and present them as part of a united DH culture.

I’ve learned, as a graduate student working on these projects, that being a part of this collecting and collating work requires a willingness to explore a wide-range of interests, and to read blog posts, white papers, and poster projects that have little to do with my own projects or areas of expertise. For example, most of the content for JDH comes from the pool of content chosen for Editors’ Choice features on DHNow, a selection process that requires Editors-in-Chief for a chosen week to read through content nominated by a group of editors-at-large whose experience in the DH community is variable. The job of the EC is to sort through these nominations, pulling out relevant job postings, conference and event announcements, calls for participation, and useful resources, then picking one or two items to feature as the Editors’ Choice for the Tuesday and Thursday of that week.

The selection of these Editors’ Choice items is left largely up to the EC for the week. There are guidelines, of course. These featured items need to be of substantial length, usually more than 500 words or 20 min. in video/presentation playback, and should make a relevant, substantive, and perhaps even provocative argument that adds to or initiates a conversation in the field. Since DHNow only links to these posts — there’s no editing involved — they should also be well-written and, if necessary, thoroughly cited. White papers and articles are generally only posted if they haven’t been published in other journals or periodicals.

While these guidelines are helpful, on good weeks Editors-at-Large nominate several pieces that meet the requirements, leaving final selection up to the EC for the week. Each of us have our own idiosyncrasies, of course, and our own areas of interest can influence our choices. We do also take into account how many times our options have been nominated, and we pay attention to that additional level of interest as well as checking for comments (in the PressForward plugin) that explain why our guest editors nominated individual items. What results is a crowd-sourced, yet still curated, publication that feeds into JDH.

Recent changes to the DHNow site — in both the sections dedicated to the Editors-at-Large and the main content pages — will hopefully encourage our guest editors to engage more in the content selection process. It will be interesting to see if new editors (and returning participants) start to leave more comments or more feedback to provide us with a better understanding of how they are selecting content to nominate. The other reason behind the redesign, beyond helping out current editors, was to pull in more outside editors. The more participants we have, the more feeds are nominated to be added to the plugin, and the more exposure both we and our editors have to the ongoing conversations and arguments circulating within the DH community. By encouraging the creation of a more engaged community, we are also pushing for more interdisciplinary participation in the field, bringing scientists, librarians, archaeologists, archivists, historians, and others into a community whose make-up should result in bigger and better projects and perhaps, a more solid sense of a DH identity.