TaxonWorks Together 2021, it's about community

01 Feb 2022 - Deborah Paul, Matt Yoder

The theme of building community framed TaxonWorks Together 2021 (TWT). Two plenary sessions, User Perspectives on TaxonWorks, and Building a Community Around Your Taxonomic Needs, provided our contributors and invited speakers a public forum to share and discuss their TaxonWorks (TW) experiences. Over 150 people registered and we had participants from countries around the globe. A common theme from users was the need for public facing pages for data in their TW databases. TaxonWorks software specifically for this purpose (i.e. purposefully separate from the database software) is now in development with a multi-year set of milestones offering specific functions being released over the next 3+ years. Please submit your ideas for the content and function of these public pages!

Guests!

We opened our week with a series of user perspectives. Hear directly from Chris Dietrich, Lily Hart, Jim Woolley and John Heraty, Heidi Hopkins, Kojun Kanda, Michelle Kohler, and Maria Marta Cigliano about TW from their points-of-view. Their talks are available at YouTube.

Our plenary speaker, Beckett Sterner (Arizon State University, https://orcid.org/0000-0001-5219-7616) presented: Building a Community Around Your Taxonomic Needs which stimulated a great discussion around such topics as group data curation, data pooling strategies, and ideas about what makes a community long-lived. His session is available on the TaxonWorks' YouTube. Insights from this session offer that inclusiveness, collective benefit, credit and attribution are all important variables for sustainable data portals. In our user perspectives session, contributors noted how much they appreciate the benefit of being able to work as a group using TW. Local to global, the Species File Group builds software, tools, and a commons that support a taxonomist’s individual and collaborative workflows to use and manage complex, detailed nomenclatural, taxonomic, and specimen-related data. All of us look forward to findings from Beckett’s recently awarded NSF grant: Explaining Differential Success in Biodiversity Knowledge Commons.

Why not Excel, or, reasons for using TaxonWorks?

One of our participants asked “why TW and not Excel?” This, and various related discussions throughout the week raised a number of reasons as to why TaxonWorks might be useful (beyond just not being Excel). Discussion highlights ("the “learning curve” is worth it") included: :

Excel limits field length, is “two-dimensional” and is not a database while
TW offers handling complex data
- Nomenclature, distributions, literature, and biological associations
- Automatic generation of taxonomic keys and taxon pages
- Built-in hard and soft validation for data fitness
TW offers collaboration, working with many folks together in the same database
- 5 natural history collections currently manage their data in TW
  - DwC Archive import and export is quickly making TW a compelling digitization solution
  - People with loans can digitize and return data for integration or receive digitized data with specimens
- Remote team-work during the pandemic
- High-school students entering data
- TW allows for the distribution of tasks, projects that once depended on one person, now can be done by many. The Universal Chalcid Database curators manage data from over 58K sources.
- Private community curation has positive aspects
- Participants noted the sharing of some data across TW projects, for example sources (i.e. literature), people data, and some geo data.
TW development team
- Responsive and data standards aware
- Long-term funding support for TW
- Listens to the needs of the community but also
- Educates users if there’s a better practice
- Willingness to customize where feasible
- Individuals can to add custom fields (more suitable and less lossy than “notes”)
TW software
- A tool that is open and not proprietary
- Makes data sharing easier and more sustainable in the short and long term
- Allows anyone worldwide to contribute to the features and functions of the software and API |

Darwin Core

Another exciting activity during the week offered us a chance to show our newly released Darwin Core Importer. With this tool, you can use your DwC-A file (or just the occurrence.txt file) to upload data into TW. Features and functions are being added and input for and testing of this new function welcome! During our BBQ, all were introduced to GitHub, and we learned a few new tricks and some software development etiquette about emojis (e.g. use them to provide feedback on comments). In our digitization session, Tommy McElrath showed us the power of several key features such as the Comprehensive Digitization (all relevant fields in one screen) and the TW “grid digitizer” (at right). And just-in-time for TWT Tommy created a new video series about Digitization Using TW.

Nomenclature

TaxonWorks manages nomenclature distinctly separate from the descriptions of those things being named. In the Nomenclature session moderated by Dmitry Dmitriev, several speakers talked about their experiences with TW to manage large nomenclatural resources, for example, the LepIndex and Universal Chalcidoidea Database (UCD). Jim Wooley walked us through how the UCD group collectively manages data from 58K sources (aka publications) in TaxonWorks. Donald Hobern noted that LepIndex needs structured sustained community curation and some enhanced TW features to make it easier to improve the LepIndex dataset in subsets (e.g. by family). Several people note that often multiple nomenclatural lists may exist and would like to see more tools that make it possible to share more of the common data where desired / feasible / and logical to improve taxonomic workflows.

Matrices

During the session on Matrices we heard from Sergei Tarasov and Kojun Kanda. Sergei is working to integrate Phenoscript with TW. Phenoscript is the first programming language for describing species (see https://github.com/sergeitarasov/PhenoScript/wiki). In all collaboration, somehow you need to track progress, yes? Kojun cleverly recognized that the matrices software can be used to do this and showed us how.

Technical

At the Technical session “It’s Developer Time”, programmers and users covered a wide range of topics. Live-coding in Ruby happened to fix and execute a script needed for querying the TW API. We heard about the User Interface (UI) and User Experience (UX) from users who would like some different options for roles and permissions as well as options to customize the UI including a customized workflow. Our Global Names developer, Dmitry Mozzherin heard that users would like to be able to compare (“do a diff”) across 2 or more taxonomic name checklists. He's built gndiff as a response. Another researcher uses the research-grade images coming from iNaturalist, and wants to pull them into TaxonWorks via the iNat API. And for those who want to jump in to help code the UI/UX of TW, developer José Luis Pereira produced a great video to get you started! See Using VueJS with TaxonWorks, part 1 on the TW YouTube.

Connect with us

Throughout TWT we reiterated the ways to connect with us. As part of our commons, we’re connected 24/7/365 via GitHub and Gitter, and of course at our weekly sessions on Wednesdays. For 2022 we updated our weekly meeting topics to: Nomenclature, Digitization, Matrices/Observations, Technical, and Open (where any topic is welcome). Please join us! It’s easy (see information under Current on the Events page for details).

To revisit, or in case you missed it, you’ll find the meeting materials linked on the event page and some recordings on YouTube.

Coming soon on Propodeum

We’ve had multiple requests (before and during TWT) for details of the underlying data models behind TaxonWorks. Great news! With a lot of hard work on the part of the TaxonWorks developers, these details are coming to light. You can get a sneak peak (though changes are certain) at docs.taxonworks.org.

Links

Back