Skip to main content

AntConc...

 

What is AntConc you may ask?

 

AntConc is a free multi-purpose corpus analysis toolkit that houses a comprehensive set of tools that includes “concordancer, word and keyword frequency generators, tools for cluster and lexical bundle analysis, and a word distribution plot.” The software was created by Lawrence Anthony, a Professor of Applied Linguistics at Waseda University in Japan. You Can find the software here.

 

https://www.laurenceanthony.net/software/antconc/

 

Some of AntConc’s tools overlap with Voyants’, for example the trends tool in Voyant displays similar information to the plot tool in AntConc, however, whereas Voyant displays the progression over the documents over a graph and in comparison to one another, the plot tool does it over a series of horizontal bars that can be compared. Voyant’s tool is visually more appealing and displays the documents so I will be using that tool instead of AntConc’s Plot.

 

The AntConc tools I intend to use are the KWIC (Keyword in context) which displays the instances of the word or words searched and how it is used in the sentences or context that they were used.

 

 

The collocate tool looks for words that are nearby the word searched more than other words in the corpus but not directly next to necessarily so they don’t form a traditional cluster; the cluster tool displays word chunks or words directly next to the word searched and the number of instances of them;  and the word clould which I can only assume most people are familiar with displays a “Cloud” or artistic style grouping of the most common words within the corpus.

 

These are the tools I have run my corpus through and will be utilizing to analyze against the traditional research I have completed.

 

Although it was difficult to initially add a stopwords list (had to download a newer version) I was able to upload my stopwords file which also aided in narrowing down the results.

 

If you would like to read more about this software you can visit the link below.

 

https://www.laurenceanthony.net/research/iwlel_2004_anthony_antconc.pdf

Comments

Popular posts from this blog

DATA DATA DATA!

I have finally published the data sets from the corpus on Zenodo. The following citations contain the links to the data.  Have at it!  Amato, Natalie. “Corpus”. Zenodo , March 27, 2025. https://doi.org/10.5281/zenodo.15098565 . Amato, Natalie. “Voyant Files”. Zenodo, March 27, 2025. https://doi.org/10.5281/zenodo.14871765.  Amato, Natalie. “Voyant Files”. Zenodo, March 27, 2025. https://doi.org/10.5281/zenodo.14871765 . Amato, Natalie. “Stopwords”. Zenodo , March 28, 2025. https://doi.org/10.5281/zenodo.15103566 . Amato, Natalie. “Nvivo Files”. Zenodo , March 28, 2025. https://doi.org/10.5281/zenodo.15103555 . Amato, Natalie. “Antconc Collocate Files”. Zenodo, March 28, 2025. https://doi.org/10.5281/zenodo.15103493 .   Amato, Natalie. “Antconc Cluster Files”. Zenodo , March 28, 2025. https://doi.org/10.5281/zenodo.15103462 .   Amato, Natalie. “Antconc KWIC Files”. Zenodo, March 27, 2025. https://doi.org/10.5281/zenodo.15098553 .    Amato, Nata...

Why are Two Columns such a Burden?? WHY?

          I apologize for the extended delay in posting. After my last post I attempted to create a work-around to convert my two column text files into one column. This proved insanely difficult. My thought process was that if I could create OCRed readable pdf files (which I thought I had done) with Foxit then I could export them to editable word documents and then convert them from two column to one column files and then export them to txt files. Did it feel like there must be an easier way to do this? Yes. But I could not find it, at least not without hitting a pay wall. Therefore, I surmised that I would have to one-by-one open files in Foxit PDF Editor, go to the “Convert” tab and then select “To MS Office” in the menu and select “To Word”. This would bring up a new “save” window where I would need to select “settings” beside the file format. Then that would bring up another window and here is where I run into another roadblock. In t...

Topic Modeling Tool

  The next tool I moved to on my corpus analysis journey was the topic modelling tool.   The Topic Modeling Tool is an interesting innovation because it utilizes MALLET (Machine Learning for Language Toolkit) to perform LDA (Latent Dirichlet Allocation) topic modeling but also incorporates a user friendly interface allowing individuals like myself who can learn basic coding but just don’t understand how to troubleshoot when things go wrong.   The tool was created by David Newman, part of the Research Faculty of Computer Science at the University of California Irvine, and Arun Balagopalan and further developed by Jonathan Scott Enderle, a Digital Humanities Specialist at the Penn Library at the University of Pennsylvania. [1] Unfortunately Enderle has since passed and therefore development of the tool has stalled until someone else decides to take up cause.   Regardless the tool was still incredibly useful for my purposes.   It ...