Arctos: nature of ID

Created on 16 Jul 2019  Ā·  247Comments  Ā·  Source: ArctosDB/arctos

http://arctos.database.museum/info/ctDocumentation.cfm?table=CTNATURE_OF_ID is a mess.

Ideally I think we should say something about the evidence used for the ID, but that doesn't seem possible. Minimally we can not say the same thing a bunch of different ways?

NATURE_OF_ID | Documentation|Hu?
-- | --|--
ID of kin | An identification based upon the identification of another related individual, often the mother of an embryo. Such a specimen should have at least one individual relationship.|spiffy
ID to species group | Within a genus, some groups of closely related species are referred to by the species name of one widespread or well known species within the group.|Hu? This (and much more) can be done with eg, Sorex {Sorex cinereus complex}, I don't think we need multiple ways of doing that.
curatorial | An identification determined by qualified personnel assisting with collection management including collection managers, curators, trained students, staff and others who may not be experts in the group in question but have some knowledge of relevant taxonomy.|this looks functionally identical to student
erroneous citation | The specimen has been cited in refereed scientific literature by this name but this name is clearly wrong. This situation arises mostly from typographical errors in catalog numbers.|spiffy - or not, but it happens
expert | The determiner is a person recognized by other experts working with the taxa in question, or the regional biota.|this looks functionally identical to student, or at least heavily overused. One agent is an expert on 7112 different taxa! Users have the tools to decide who they consider experts and act on that.
field | A determination made without access to specialized equipment or references.|"Looks like a moose" may be necessary, but I'm still not sure how it's not yet another version of 'student.'
geographic distribution | Specimen is assumed, on the basis of known geographic ranges, to be the species or subspecies expected at the collecting locality. The specimen has not been identified to species or subspecies by comparing it to other subspecies within the genus or species. "It's probably that species because that species lives there and we know what species lives there because we're museums and telling people where stuff lives is what we do....."|still looks circular to me
legacy | The identification has been transposed from an earlier version of data that did not include identification metadata. In this case the date of the determination is the date that the data were transposed, and the determiner is unknown.|I think we're stuck with the concept, but maybe "unknown" is a better label
molecular data | An identification made by a laboratory analysis comparing the specimen to related taxa by molecular criteria, generally DNA sequences.|yay us!
photograph | "Field ID" or perhaps "morphology" is probably always more important than this.|"student" version 5
published referral | The specimen has been specifically determined to be of a particular taxon in a publication that describes or re-describes that taxon, but the specimen has no type status. Such a specimen record should include a citation, and the determiner(s) of record should be among the authors of the publication. (This means nothing.)|I still have no idea what this means
revised taxonomy | This designation is appropriate only in the presence of an earlier identification. It implies that the specimen has not been reexamined, and only that a different taxonomic name is being applied. In most cases this results from taxonomic synonymization of names.|we're stuck with this
student | Specimen has been identified by a person using appropriate references, knowledge, and/or and tools, but not by an expert. This is a broad use of the term student.|I think maybe everyone hates the label, but the concept seems accurate for the vast majority of our identifications
type specimen | This particular specimen has been described in the literature by this name. The specimen record should contain a citation of the appropriate literature,and the determiner(s) of record should be among the authors of the publication.|yay us!

@ccicero

Function-CodeTables Function-TaxonomIdentification In progress Priority-Critical

All 247 comments

We've talked about this - #1093 maybe we can start there.

We've already gone through the "student" discussion in
https://github.com/ArctosDB/arctos/issues/1093. Let's not go there again.
Reviving another related request from the earlier thread:
4) Dusty, please get rid of "ID of kin" as default nature of ID; leave
blank value and require selection from list.
… https://github.com/ArctosDB/arctos/issues/1093#

On Tue, Jul 16, 2019 at 3:34 PM Teresa Mayfield-Meyer <
[email protected]> wrote:

We've talked about this - #1093
https://github.com/ArctosDB/arctos/issues/1093 maybe we can start there.

—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBCHILW7P52LDNPLCJTP7Y5FNA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2CHFUQ#issuecomment-511996626,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ADQ7JBESUSF5LIIOQC7UY63P7Y5FNANCNFSM4IEIGLOA
.

I agree strongly that we should remove ID of Kin as the default value - too
many errors result. Leave it blank & require it be filled.

student vs expert vs curatorial - I sometimes have a hard time deciding
between my own IDs - are they expert or student? The addition of curatorial
as an option makes this decision making even worse. Do I use expert or
curatorial (or student)? I've argued before and will again that the least
ambiguous set of choices for this would be 'expert' vs 'non-expert'.
Non-expert is the clear choice for any IDs not made by an expert and*
knowing whether it was made by an expert or not is the most important thing
to know about an id*. If someone told me these IDs were made by students
and these others by curatorial staff I would wonder about my own curatorial
staff, who are often students... and myself.. and I consider myself a
student of many taxa but an expert in just a few. Also, I'm an expert at
getting ALL insects ID'd to order, but only an expert at getting some
insects ID'd to species.

So I argue for replacing both 'student' and 'curatorial' with 'non-expert'

I agree that 'field' is ambiguous - if an expert is in the field and is
confident of what the ID is without recourse to keys then use 'expert'
instead. If it's not an expert, use 'non-expert'. Add remarks to the
remarks if desired.

I've never used 'ID to species group' and agree with Dusty that we have
better ways to show that - plus we still would want to know if it was an
expert who made that ID to species group and this option prevents us from
storing that information.

Geographic distribution - climate change is making some geographic regions
more tolerable than they had been previously.... perhaps this should be
restricted to remarks. Eg. an expert who decides what it is and then adds
this information to the remarks is a more valuable ID than just using
'geographic distribution' as if anyone (non expert vs expert) could safely
make an identification of the same reliability.

I think photograph should be in remarks also - an expert ID based on a
photo is a lot different than a non-expert ID based on a photo.

published referral- oftentimes we don't know who did the IDs of the species
listed in papers (sadly) and it's not safe to assume it was one or more of
the authors. This is similar to legacy but makes it clear the ID came from
a publication, not a label on a specimen in front of you.

revised taxonomy - this is important to record but I dislike that it hides
information about expert vs non-expert ID... for example, if I loan
specimens to an expert and get them back and later one of the species is
moved to a new genus I update all the records with the new name and myself
as the determiner and use revised taxonomy. But when these data are shared
to aggregators or downloaded by users, they rarely or don't see that the
specimens were ID'd by an expert... they only see the most recent ID by
myself, a non-expert, who didn't even look at the specimens, I just updated
the name.

I've long wished this could be altered somehow - perhaps needing new fields
to store information about this while retaining the ID of the last person
who actually looked at the specimens... somehow.

my 2 cents.

-Derek

On Tue, Jul 16, 2019 at 1:48 PM Mariel Campbell notifications@github.com
wrote:

We've already gone through the "student" discussion in
https://github.com/ArctosDB/arctos/issues/1093. Let's not go there again.
Reviving another related request from the earlier thread:
4) Dusty, please get rid of "ID of kin" as default nature of ID; leave
blank value and require selection from list.
… https://github.com/ArctosDB/arctos/issues/1093#

On Tue, Jul 16, 2019 at 3:34 PM Teresa Mayfield-Meyer <
[email protected]> wrote:

We've talked about this - #1093
https://github.com/ArctosDB/arctos/issues/1093 maybe we can start
there.

—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBCHILW7P52LDNPLCJTP7Y5FNA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2CHFUQ#issuecomment-511996626
,
or mute the thread
<
https://github.com/notifications/unsubscribe-auth/ADQ7JBESUSF5LIIOQC7UY63P7Y5FNANCNFSM4IEIGLOA

.

—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM7DHT2UIOFAKT35FW3P7Y6ZPA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2CIHVI#issuecomment-512000981,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUMYXR4HGVLAQUAZHO33P7Y6ZPANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

I'll say what I said before:

I don't believe that we are qualified to make an assessment of the expertise of any identifier. Wouldn't it be better to only provide objective data? Most of the options are pretty much that.

Most of these designations are opinions as Derek demonstrated with his "I am an expert, I am a student" argument. The rest would be supported with a remark or citation. If the ID was made in a publication (revised taxonomy), then there should be a citation - no need to say any more. I feel like this field is unnecessary in a way and I don't see an equivalent DWC Identification term.

for example, if I loan
specimens to an expert and get them back and later one of the species is
moved to a new genus I update all the records with the new name and myself
as the determiner and use revised taxonomy. But when these data are shared
to aggregators or downloaded by users, they rarely or don't see that the
specimens were ID'd by an expert... they only see the most recent ID by
myself, a non-expert, who didn't even look at the specimens, I just updated
the name.

I've long wished this could be altered somehow - perhaps needing new fields
to store information about this while retaining the ID of the last person
who actually looked at the specimens... somehow.

This is an aggregator issue. Anyone looking at the record in Arctos could see what happened. Until GBIF&Co can handle the multiple IDs, I'm not sure we can do anything to make things look better on their end. I suggest you open an issue with iDigBio about it and start a wider discussion.

I knew this was eerily familiar...

Looks like we're more or less agreed on 'legacy'-->'unknown' - I'll make that change if nobody stops me soonish.

remove ID of Kin

That sort of thing gets lost in these kind of Issues. I moved it to https://github.com/ArctosDB/arctos/issues/1868, which I think I'm just going to have to do in production, which is probably going to be disruptive. I re-prioritized and will try to sneak it in ASAP.

I don't believe that we are qualified to make an assessment of the expertise of any identifier. Wouldn't it be better to only provide objective data?

YES!

I don't think we're going to convince anyone that a dedicated volunteer necessarily produces lower-quality IDs than random 'students,' or that becoming a grad student magically makes someone an "expert" for everything, everywhere, using any technique (and I can't see any other way to interpret some of the data in Arctos).

Conversely, about anyone could do a bang-up job of feeding a sequence to genbank and recording whatever flops out.

Geographic distribution - climate change

Excellent point.

as if anyone (non expert vs expert) could safely make an identification of the same reliability.

That's my reading of the current documentation - "looks like a woodpecker - we're HERE - must be THAT woodpecker...."

I consider myself a student of many taxa but an expert in just a few.

That I think is getting close to useful, but I also suspect others working with those taxa know that and don't need to be told. That is, technique+agent gets at "expertness" (or the lack thereof), and we don't need an explicit pigeonhole.

There's also some weird "experience factor" in there. The first ID you make with some new technique/taxa/whatever probably isn't as good as your 10,000th. I have no idea what to do with that....

non-expert

That makes sense only if we're contrasting it with "expert," and I obviously see dubious value in "expert."

argue for replacing both 'student' and 'curatorial' with 'non-expert'

I'm not crazy about the vocabulary, but completely agree with the direction.

published referral

Fair enough, and I suppose that more or less reflects the structure of citations. Minimally they didn't reject the possibility that the thing is whatever they called it.

I think that's another reason to hate "expert" (or ranking identifiers in general) as well - I'd guess a fair number of these get entered as "expert" (they're publishing on the taxa, after all), if we're using the terms arbitrarily then they can't really DO STUFF. Maybe that's just a matter of documentation.

revised taxonomy

Interesting point. I might be tempted to bring the agent and technique (if we had such a thing...) over from the previous ID, which would preserve that, but there may be some unjustified assumptions in that too - eg, maybe the 'expert' DOES NOT think "Myodes" is just another way of spelling "Clethrionomys" and your curatorial assumptions about the relationship between those names add or subtract something from the original. That would also remove the fact that the revision is your assumption. You may be right in that we're missing something, but I can't quite see what.

I have not read this whole thread in detail, but I think we are confounding who is making the ID (expert, student, etc.) with the basis for the ID.

Can we make this more about the basis for the Identification:

  • Geographic: Identification based on range maps or textual descriptions of geographic distribution. [We use this all the time for subspecies of birds and mammals, so have to keep it]

  • Molecular: Identification based on comparative genetic data.

  • Phenotype: Identification based on qualitative or quantitative examination of external characters. [field would apply here]

  • ID of kin: as above.

etc. (I'd need to think about this for every existing value, and there may be new ways we're not thinking about).

The 'who' part (expert or non-expert) should be through the agent making the determination. e.g., agent Peter Pyle (=expert) made identification based on phenotype on a specific date.

Thanks Carla - I like this direction, I just didn't know anyone else did!

From your comments, here's a first pass at mapping. If we go here we could probably do better for many of the unknowns - eg, IDs by {person} or before {date} or ... could get "upgraded" to something more specific.

old | new|Hu?
-- | --|--
ID of kin|ID of kin|secondary; see below
ID to species group|unknown
curatorial|unknown
erroneous citation|unknown|This belongs in https://arctos.database.museum/info/ctDocumentation.cfm?table=CTCITATION_TYPE_STATUS, not sure what it's doing here too??
expert|unknown
field|phenotype
geographic distribution|geographic distribution|secondary; see below
legacy|unknown
molecular data|genotype|I don't really care what the terminology is, but we should strive for consistency if possible. (Maybe nongenetic molecules are used in IDs??)
photograph|phenotype
published referral|published referral??|Ideally these would come from the nature of the publication, but that could require multiple values. I suppose as long as the list is short "phenotype; genotype" isn't THAT evil
revised taxonomy|revised taxonomy|secondary; see below
student|unknown|Maybe these are all "phenotype"??

ID of kin, geographic distribution, and revised taxonomy are "dependent" or secondary IDs; you need to see the mother's ID to know anything about how much you should trust the embryo's, a taxonomic revision is basically just a search term and the important data is in a previous identification, and geographic distribution is (from my perspective) pretty close to useless by itself, but I can definitely see how it adds value as a 'suggestion' on top of that 'Peter Pyle (=expert) made identification based on phenotype on a specific date' "primary" ID.

I'm still not crazy about the idea of identifications whose validity hinges on the accuracy of locality data.

I'm not sure how to model that (something about "accepted" getting more complex, probably) and I'm certainly not sure how to ship it to GBIF, but it feels like there's some _correct_ in there somewhere....

Some sort of 'accepted, but not all you need to know' "acceptedness" might also take care of the "these 12 techniques all say _Bla blah_" that might otherwise get lumped into "published referral."

Seems the tide is shifting against my interest in maintaining 'expert' as
an option. After some thought I realized I've only used that field for a
search on 'expert' once.

I wanted to see how many Identifications I had made as an 'expert' vs how
many I had made as a 'student'. I used this information in my promotion and
tenure file. There were lots in both categories but the ones as expert were
of course more valuable. It would be a terrible loss of information to not
be able to distinguish between these two types of identifications. I would
be unable to perform this search just using taxa I know well (and it would
take separate searches on all the different taxa....) because there are too
many to remember and it depends on how well written were the keys I was
using etc. I'm a beetle specialist but for some beetle groups I can't
perform expert IDs while for others I can. When I enter the information on
the ID into Arctos I assess my confidence in the ID at that point. It's
valuable data.

These terms are vague yes, but they are an attempt at categorizing
confidence in the reliability of the ID. ( I wouldn't want to have to rank
our confidence in an ID on a 10 point scale, that would be horrible).
Perhaps we need a new field for this, since it is different information
than HOW the ID was made.

Perhaps we need a field named "ID confidence" and all those marked with
method = expert now should get "ID confidence = high" and all those marked
with method = student (or curatorial) should get "ID confidence = not high"
.

Also, since we have the method of 'molecular' it's safe to assume that all
IDs that were made by 'expert' or 'student' could be replaced by
'phenotype' rather than 'unknown'. If someone wasn't using phenotype they'd
have specified what they were using. Phenotype is the default.

-Derek

On Tue, Jul 16, 2019 at 9:34 PM dustymc notifications@github.com wrote:

Thanks Carla - I like this direction, I just didn't know anyone else did!

From your comments, here's a first pass at mapping. If we go here we could
probably do better for many of the unknowns - eg, IDs by {person} or before
{date} or ... could get "upgraded" to something more specific.
old new Hu?
ID of kin ID of kin secondary; see below
ID to species group unknown
curatorial unknown
erroneous citation unknown This belongs in
https://arctos.database.museum/info/ctDocumentation.cfm?table=CTCITATION_TYPE_STATUS,
not sure what it's doing here too??
expert unknown
field phenotype
geographic distribution geographic distribution secondary; see below
legacy unknown
molecular data genotype I don't really care what the terminology is, but
we should strive for consistency if possible. (Maybe nongenetic molecules
are used in IDs??)
photograph phenotype
published referral published referral?? Ideally these would come from the
nature of the publication, but that could require multiple values. I
suppose as long as the list is short "phenotype; genotype" isn't THAT evil
revised taxonomy revised taxonomy secondary; see below
student unknown Maybe these are all "phenotype"??

ID of kin, geographic distribution, and revised taxonomy are "dependent"
or secondary IDs; you need to see the mother's ID to know anything about
how much you should trust the embryo's, a taxonomic revision is basically
just a search term and the important data is in a previous identification,
and geographic distribution is (from my perspective) pretty close to
useless by itself, but I can definitely see how it adds value as a
'suggestion' on top of that 'Peter Pyle (=expert) made identification based
on phenotype on a specific date' "primary" ID.

I'm still not crazy about the idea of identifications whose validity
hinges on the accuracy of locality data.

I'm not sure how to model that (something about "accepted" getting more
complex, probably) and I'm certainly not sure how to ship it to GBIF, but
it feels like there's some correct in there somewhere....

Some sort of 'accepted, but not all you need to know' "acceptedness" might
also take care of the "these 12 techniques all say Bla blah" that might
otherwise get lumped into "published referral."

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM56SXPES7PLLCL3XLDP72VNBA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2DCTCY#issuecomment-512108939,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM2C4GUVJF22NEQ54E3P72VNBANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

@DerekSikes I don't think I see a big-picture problem with that.

  • this specimen was identified using phenotype
  • the person doing the ID, at the time they were doing the ID, had {"experience factor"/confidence/whatever}

I just don't like confounding those concepts in one field. If I'm a geneticist looking for cryptic species, I probably only want things IDed using some sort of decent non-genetic technique, and our current data don't consistently support that. The ability to add "[not]confidently IDed" to that just seems like a bonus.

(I also don't much trust those data, but maybe that's another issue.)

And on that note, it probably takes us back to some sort of vocabulary/definition nightmare, but I do think there's some distinction somewhere between "phenotype, as in microscopes and slides and trait matrices and such" and "phenotype, as in it just looked like a badger." Maybe we do need to retain (and better define) "field" or something like it.

I'm not sure I understand this:

wouldn't want to have to rank our confidence in an ID on a 10 point scale, that would be horrible

anything using expert/student (and synonyms) is basically ranking confidence on a 2-point scale, no? I'm not sure how many points are in between "I just bought a field guide in the gift shop" and "I am describing the holotype" but it does seem like some sort of confidence scale.

No problem to adjust the migration path - I'll be asking lots of questions if we get to that point.

@DerekSikes why does it matter if you were an expert or a student - effort was involved either way. Is every ID I made less valuable because it was a student ID? It may have been an excellent addition to my knowledge base to make those IDs and some ID is better than "Animalia", correct? Never mind that you were the one deciding whether you were an expert or student? That seems like you could rig your own promotion...not that I'm accusing you of that, but just trying to make the point that "expert" and "non-expert" are subjective and one person's expert will be another person's non-expert. These kinds of criteria seem a poor way to determine a promotion and I am not in favor of Arctos becoming the arbiter of people's employment path.

I'm not sure I understand this:

wouldn't want to have to rank our confidence in an ID on a 10 point

scale, that would be horrible

anything using expert/student (and synonyms) is basically ranking
confidence on a 2-point scale, no?

Yes, a 2- point scale is reasonable. Even deciding how to rank these on a
3-point scale would be hard and hardly reproducible, even for the same
person at different times of the day! a 10 point scale would be nuts.

-Derek

On Wed, Jul 17, 2019 at 7:32 AM dustymc notifications@github.com wrote:

@DerekSikes https://github.com/DerekSikes I don't think I see a
big-picture problem with that.

  • this specimen was identified using phenotype
  • the person doing the ID, at the time they were doing the ID, had
    {"experience factor"/confidence/whatever}

I just don't like confounding those concepts in one field. If I'm a
geneticist looking for cryptic species, I probably only want things IDed
using some sort of decent non-genetic technique, and our current data don't
consistently support that. The ability to add "[not]confidently IDed" to
that just seems like a bonus.

(I also don't much trust those data, but maybe that's another issue.)

And on that note, it probably takes us back to some sort of
vocabulary/definition nightmare, but I do think there's some distinction
somewhere between "phenotype, as in microscopes and slides and trait
matrices and such" and "phenotype, as in it just looked like a badger."
Maybe we do need to retain (and better define) "field" or something like it.

I'm not sure I understand this:

wouldn't want to have to rank our confidence in an ID on a 10 point scale,
that would be horrible

anything using expert/student (and synonyms) is basically ranking
confidence on a 2-point scale, no? I'm not sure how many points are in
between "I just bought a field guide in the gift shop" and "I am describing
the holotype" but it does seem like some sort of confidence scale.

No problem to adjust the migration path - I'll be asking lots of questions
if we get to that point.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM5VXINS44AANLJRYEDP743Q3A5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2EZFCY#issuecomment-512332427,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM67A2Q2FJV5SKP7HZDP743Q3ANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

I think we're also drastically under-using id_sensu (fkey-->publication); it could clarify some of the ambiguity in "expertness" (or serve as an independent measure of "expertness") for anyone who wants to dig, along with the usual purpose of clarifying the taxon concept.

Man, you're good at a lot of stuff!! :-)

Don't distract me, I'm still countin' legs.

BTW, I retract my comment about DWC - it seems like we are trying to fill identificationVerificationStatus with this. This standard is defined as : Comments or notes about the Identification. Pretty sure no one is going to know what most of our current terms mean and also pretty sure most people will know what the terms in Carla's list do!

2160

I underuse that because to use it properly I'd have to have all the keys I
use already loaded into Arctos publications to link to & this would be a
massive data entry project unto itself.

Love the idea, but it fails due to cost of getting content in to make the
idea work.

-D

On Wed, Jul 17, 2019 at 8:48 AM dustymc notifications@github.com wrote:

I think we're also drastically under-using id_sensu (fkey-->publication);
it could clarify some of the ambiguity in "expertness" (or serve as an
independent measure of "expertness") for anyone who wants to dig, along
with the usual purpose of clarifying the taxon concept.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUMYLLQAOCQQMZNQF5I3P75EMPA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2FAQ3Q#issuecomment-512362606,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM3YGQNELJGLKJ5SIJ3P75EMPANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

to use it properly I'd have to have all the keys I use already loaded into Arctos publications to link to & this would be a massive data entry project unto itself.

Maybe we can automate this a bit? See #2176

I reiterate that I think we can/should get to the 'expert' vs 'student' etc issue via agents. ID is made by a certain person on a certain date. The person could be a student at one point in time, a post-doc (more of an expert) at another time, and a curator (=expert) at a third time. Also, I would say that undergraduate student and graduate student have different levels of expertise, and that doesn't come in the 'student' value for ID.

We should separate the how from the who. This would involve expanding on agents and adding agent role over time to create history (now role is just in remarks or maybe address):
agent X1 = graduate student between dates A and B
agent X2 = postdoc between dates C and D
agent X3= curator between dates E and F

then apply one of those three agent values to the determination.

???

I would say that undergraduate student and graduate student have different levels of expertise

Still a matter of opinion and varies by the student involved. I really dislike adding our opinions. Facts are good - the position of the agent at the time of ID seems like fair information to pass along. Users can decide what that means (my grad student may be more of an "expert" than someone else's tenured faculty depending on the organism involved....).

If someone does something that does not suck with publications (or taxonomy, or agents, or...) it should be trivial to plug in, in any capacity from making something trivial better to just replacing our model. Not holding my breath...

If the publications have DOIs, it should be fairly easy to sprinkle the create form around. (I doubt these do??) If the authors submit ORCID with the publication and to Arctos I could magic the author part, which is about 90% of the current work in creating a publication. (Even less likely, I'm sure.)

Unrelatedish, if one publication here is too much work then maintaining a taxon concepts model does not seem remotely realistic. This is not the first time that idea has surfaced, or at least caused some ripples.

The intent of "student" has always been 'one willing to learn,' and the documentation has always supported that.

@ccicero I don't think this is something that can be normalized to people. If I bring you a North American bird, I'm pretty much just going to believe whatever you tell me. If I bring you a deepwater coral larvae - well, maybe not so much. (Nevermind, I think I figured it out below...)

I think the really critical thing here is separating technique. That's going to open up a whole new world of ways to query the data, let people get at really fundamental things - do genetics and morphology support the same hypotheses? - that just aren't accessible in our data currently.

If there's some value in ranking the confidence of the ID, I can't see how it detracts from anything. I don't think I'd ever use or believe it, but I also don't think it'd ever prevent me from doing anything so I'm fine with it.

Actually I might use and believe it.

And on that note, it probably takes us back to some sort of vocabulary/definition nightmare, but I do think there's some distinction somewhere between "phenotype, as in microscopes and slides and trait matrices and such" and "phenotype, as in it just looked like a badger." Maybe we do need to retain (and better define) "field" or something like it.

The "confidence score" could be a useful way to get at that - "Hi I'm me and despite whatever my agent-stuff might imply, you shouldn't trust me very much on this particular ID."

https://github.com/ArctosDB/arctos/issues/1873 might be the best way to organize your data - it would require a few more relationships ("graduate student of" etc.). Certainly nothing above prevents those agent-data-including queries. And including the taxa used in the IDs brings the agent's 'expertness' into perspective....and AHA!, maybe, I think.

So Carla+undergrad+Pipilo finds birds IDed between certain dates (defined in agent-data) by a certain agent - essentially a dynamic "trust factor."

So Carla+postdoc+Pipilo [+phenology + whatever else is relevant] finds birds a different set of birds (unless there are re-ids) under a different "trust factor" (still dynamically defined by the user at that moment).

So if we add one more field ("confidence score"?? Need a name) and a categorical code table (with 2 categories to start with) to feed it, I think everybody's happy and this is just a matter of vocabulary and eventually UI to hook deeper into agents.

Yay us?!?

I think we are converging on something. I agree @Jegelewicz about sticking to facts. By adding a role explicitly to agents (undergraduate student, graduate student, postdoc, curator, etc.), it gets at the status of a person during a certain tenure without saying anything about confidence in that person.

I'm not sure I like 'confidence score' - that seems pretty subjective. Leave that to the person using the data. Cicero+Pipilo+audio (audio, video should be other categories of 'how') would have fairly high confidence, but the person using the data could/should interpret based on knowing something about my research rather than us providing that info.

--> I think the really critical thing here is separating technique. That's going to open up a whole new world of ways to query the data, let people get at really fundamental things - do genetics and morphology support the same hypotheses? - that just aren't accessible in our data currently.

This is what I'm getting at. We should work on coming up with a vocabulary for technique, and that should be separate from information about the person and his/her role/status/capacity. We'd need to be able to apply >1 technique to an ID.

DNA sequence
genomics
morphology
coloration
microscopy
audio
video
etc.

By adding a role explicitly to agents

Minor point: adding a role to RELATIONSHIPS (which are already scheduled to get a date-component). That'll also let me get at WHERE you're a student-or-whatever, so I can add a little extra statistical weight to graduates of Dusty's Institoot o hi-er learnin'...

'confidence score' - that seems pretty subjective.

Yes, I believe that's the point.

(audio, video should be other categories of 'how'

I'm not quite sure that's correct - it's more "behavior" or perhaps "song" - like 'photo' it's the information, not the media, that we care about - but yes, I noticed that gap as well.

So "Cicero+Pipilo+audio" is the "baseline possible."

You can ignore the 'confidence score' (when searching and creating IDs) for most stuff.

If you're trying to ID a bird off of a damaged tape and you just can't quite hear enough to be sure or something, you can use 'confidence score' as a searchable way to flag that particular ID. (And I can use your agent info to not only get at how much I trust your IDs, but also how much I trust your evaluation of your IDs!)

Minimally, I don't think it can detract from anything else, and it seems that some of us have a use case for it. Ideally, perhaps it does add some real less-subjective value in some specific situations.

We'd need to be able to apply >1 technique to an ID.

That's probably a simpler approach than my vague ideas regarding "secondary" or "dependent" IDs. (And the "cool toys" comment in https://github.com/ArctosDB/internal/issues/27#issuecomment-512071298 might make it trivial without mucking with the structure - not 100% sure of that.)

Would this be one term/list, or perhaps a general+specific split - eg,

morphology; coloration; microscopy

or

general: morphology
specific: coloration; microscopy

??

I'm very much in favor of a field for 'confidence score.'

In entomology we make a big deal about identifications. I try to train my
students to work within their skill set - don't try to ID something to
species until you've shown that you can reliably ID at higher levels first
(orders & families).

A 2-category confidence score gives one a way of saying "high confidence =
my reputation is on the line, if I got this wrong I'd be embarrassed and
suffer reputational damage. Thus I will use this only when I'm really
confident."

"not-high confidence = I did my best but I'm not an expert on this group
and if I got it wrong at least I warned you I wasn't that confident in the
ID. No reputational damage suffered."

Granted, for older collections where most of the databasing is being done
of specimens ID'd in the past & not by the curators/CM staff themselves,
this is harder to use if you don't know the people and their reputations
for expertise with certain taxa. However, in UAM entomology we make 40+
loans a year of tens of thousands of specimens and we loan them to people
doing taxonomic work on the groups who know these taxa better or as well as
anyone else alive. When these specimens come back from loan I mark their
IDs as 'expert' - this is useful because in 100 years folks might have a
hard time working out who was an expert on which taxa when... etc.

So I'm in favor of 'confidence score' with 2 categories. Deciding on the
names for these 2 categories is tricky though. I like 'expert' and
'non-expert' which is more obvious and means different things than 'high'
vs 'not-high' because people would interpret 'not-high' as synonymous with
'low-confidence' even though that's not a safe assumption, it'll still
happen. Or we could use numbers 1 vs 0 (but I prefer 'expert' and
'non-expert'

Also, I reiterate that all IDs by experts and students would be method =
phenotype since that's the default method, if molecular or other methods
had been used, they'd have been specified.

-Derek

On Thu, Jul 18, 2019 at 8:08 AM dustymc notifications@github.com wrote:

By adding a role explicitly to agents

Minor point: adding a role to RELATIONSHIPS (which are already scheduled
to get a date-component). That'll also let me get at WHERE you're a
student-or-whatever, so I can add a little extra statistical weight to
graduates of Dusty's Institoot o hi-er learnin'...

'confidence score' - that seems pretty subjective.

Yes, I believe that's the point.

(audio, video should be other categories of 'how'

I'm not quite sure that's correct - it's more "behavior" or perhaps "song"

  • like 'photo' it's the information, not the media, that we care about -
    but yes, I noticed that gap as well.

So "Cicero+Pipilo+audio" is the "baseline possible."

You can ignore the 'confidence score' (when searching and creating IDs)
for most stuff.

If you're trying to ID a bird off of a damaged tape and you just can't
quite hear enough to be sure or something, you can use 'confidence score'
as a searchable way to flag that particular ID. (And I can use your agent
info to not only get at how much I trust your IDs, but also how much I
trust your evaluation of your IDs!)

Minimally, I don't think it can detract from anything else, and it seems
that some of us have a use case for it. Ideally, perhaps it does add some
real less-subjective value in some specific situations.

We'd need to be able to apply >1 technique to an ID.

That's probably a simpler approach than my vague ideas regarding
"secondary" or "dependent" IDs. (And the "cool toys" comment in ArctosDB/internal#27
(comment)
https://github.com/ArctosDB/internal/issues/27#issuecomment-512071298
might make it trivial without mucking with the structure - not 100% sure of
that.)

Would this be one term/list, or perhaps a general+specific split - eg,

morphology; coloration; microscopy

or

general: morphology
specific: coloration; microscopy

??

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM7R6NT4ZEMCBGIOBCTQACIPNA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2I7NKI#issuecomment-512882345,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM6BXZPARIGLYXZ63BTQACIPNANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

Confidence score seems different than 'expert' versus 'non-expert' - even an expert can have a less then perfect confidence in an ID. What about something like 'certain' and 'uncertain' or something like that. Also again, 'expert' and 'non-expert' is really related to the person = agent.

---> Would this be one term/list, or perhaps a general+specific split

I'd say a list of terms, apply all that are relevant and then concatenate into a single field?

I like 'certain' and 'uncertain' - I'm no expert, but I'm also fairly sure I've never misidentified a moose or walrus.... That seems to jive with Derek's use case as well.

concatenate into a single field

I think we can make that work, and if we go to https://github.com/ArctosDB/internal/issues/27#issuecomment-512071298 we can treat it as a data object (array) instead of a concatenation. (We can do that in Oracle too, but it involves creating a datatype/it's a little more complicated.)

I see no obstacles to implementation at this point, other than vocabulary.

I suggest "nature of ID" (I think we should retain that vocabulary??) remain NOT NULL with an "unknown" (=we're not ignoring this because we can, we don't know) option, and 'confidence' (??) be NULL; most of us are just not going to have that information much of the time.

@DerekSikes for migration purposes I can be as specific as the data allow - "all IDs by experts and students would be method = phenotype" isn't a problem as long as you can somehow identify "experts and students," and we can take another swing if we miss something. Going forward there would be no defaults - how you use this (and if you use it, in the case of NULLable fields) would be left to the collections.

SO - vocabulary time?

Here's Carla's list + the current vocabulary I think we need to retain and some comments.

  • DNA sequence - how is this not "genomics"?
  • genomics
  • morphology - "gross morphology" to distinguish from microscopy?
  • coloration
  • microscopy - I think this is somehow both too broad (karyotype should probably be it's own thing, if we still have any of that) and too narrow (eg, you could look at insect genetalia through a microscope or CT scanner). Maybe close enough for now??
  • audio - "call"???
  • video - this is tricky - probably includes calls, "gross morphology," maybe habitat, .... - IDK, maybe "video" encompasses all of that and is just right?? (Keep "audio" too for the same reasons???)
  • etc. - this will be a code table, we can add to it at any time, I don't think we need to anticipate everything, just get everything that we need immediately.
  • ID of kin
  • geographic distribution
  • unknown (nĆ©e legacy)
  • published referral (I'm still - perhaps again - not sure I understand this - and 'type specimen' - both of which can be derived from citations attached to the identification.)

Yes, this is a qualification on the identification itself, not on the
agent. As I've said, I'm an expert only on some taxa, not all, thus I use
'expert' ID only for those for which I claim expertise.

I've not argued we should be using this term to qualify the agents. I think
that's a rabbit hole of little value. These data (expert vs student) for
Identifications are just ways to state how reliable the IDs are.

If it were a field for the agent it would be in the agent table.

-D

On Thu, Jul 18, 2019 at 3:03 PM Carla Cicero notifications@github.com
wrote:

Confidence score seems different than 'expert' versus 'non-expert' - even
an expert can have a less then perfect confidence in an ID. What about
something like 'certain' and 'uncertain' or something like that. Also
again, 'expert' and 'non-expert' is really related to the person = agent.

---> Would this be one term/list, or perhaps a general+specific split

I'd say a list of terms, apply all that are relevant and then concatenate
into a single field?

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM5DBS4A3N5Q4M5DZDDQADZDXA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2KCKHA#issuecomment-513025308,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM2AHTX6TJXECBNYHT3QADZDXANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

It would be helpful to create a google doc for vocabulary and definitions to aid in migration/documentation? Can we do that please?

google doc

On it....

I don't like 'certain' vs 'uncertain' because certain means 100% whereas
'high confidence' means > ~90%

-D

On Thu, Jul 18, 2019 at 3:54 PM dustymc notifications@github.com wrote:

google doc

On it....

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM546XWNNRJGJICROKDQAD7B3A5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2KFDDI#issuecomment-513036685,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM2WKUUGQMGLDW74DVLQAD7B3ANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

https://docs.google.com/spreadsheets/d/1JfbcpVYTK73DKRkgJQ0jNrFziS9zcbx7lEq872WrfYs/edit#gid=115578899

There are two tabs - 'migration path' will remain very waffly until 'ctnature_of_id' is solidified, and will only come into play after we've dealt with whatever needs dealt with by itself (Derek's "all IDs by experts and students would be method = phenotype" and similar).

Are we agreed on "confidence" for the new field?

I'm fine with "[high|low] confidence" for the terminology, but I'm also fine with absolutely anything else that gets the idea across. Like all Arctos vocabulary, the term means precisely what we define it to mean, and using it in any way outside the definition we assign is objectively wrong.

re: "I'm fine with "[high|low] confidence" for the terminology, but I'm
also fine with absolutely anything else that gets the idea across. Like all
Arctos vocabulary, the term means precisely what we define it to mean, and
using it in any way outside the definition we assign is objectively wrong."

Few users, even power users, know all the definitions (or bother to look
them up) & humans are error making machines so the more intuitive and
simple we make the terms the fewer errors will be introduced.

We could for example define "high confidence" as "confidence is low" and
then not be surprised when all the data are a mixture of confused errors.
Let's not rely on the definitions.

-Derek

On Thu, Jul 18, 2019 at 4:18 PM dustymc notifications@github.com wrote:

>

https://docs.google.com/spreadsheets/d/1JfbcpVYTK73DKRkgJQ0jNrFziS9zcbx7lEq872WrfYs/edit#gid=115578899

There are two tabs - 'migration path' will remain very waffly until
'ctnature_of_id' is solidified, and will only come into play after we've
dealt with whatever needs dealt with by itself (Derek's "all IDs by experts
and students would be method = phenotype" and similar).

Are we agreed on "confidence" for the new field?

I'm fine with "[high|low] confidence" for the terminology, but I'm also
fine with absolutely anything else that gets the idea across. Like all
Arctos vocabulary, the term means precisely what we define it to mean, and
using it in any way outside the definition we assign is objectively wrong.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUMZVBDELJH4P5PGCPY3QAEB4PA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2KGICA#issuecomment-513041416,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM3SWW7FK2TEEYZQTHTQAEB4PANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

Obviously the terms should be as intuitive as possible, and no doubt the further we drift from that the more they're misused and data quality (eg, ability to talk to other things or answer deep questions) drops off. Avoiding confusing terminology is occasionally a decent reason to avoid all vocabulary and do something symbolic - things like 1-5 star reviews are built on this idea.

This will be applied to walrus (pretty hard to get wrong) and fossil pollen (magic, as far as I can tell) and ... - whatever we do should make some sense for anything that might end up in Arctos. I think I'm hearing that vague is better, and we should avoid any terms with numeric connotations.

"confident" and "less-confident" (and "SWAG"....)??

A sliding scale between 0 (this might not be a bird at all) and 100 (it can't possibly be anything else)??

?????

--> Are we agreed on "confidence" for the new field?

Fine with me.

---> I'm fine with "[high|low] confidence" for the terminology, but I'm also fine with absolutely anything else that gets the idea across.

High/low confidence is ok with me.

I just went through the google doc and made some edits to the nature of ID terms, and added a column with comments on what I did and reasons. I like it! Looking through this, I think it makes sense to just have one ID basis per ID; different bases for ID can be applied through ID history. I have not done anything yet with the migration path.

This brings up a related issue (can't find it but I think it already exists) for how to deal with multiple IDs in data entry - both individual form and bulkloader. When a specimen comes back from the field or is prepped as salvage, it's often ID'd using 'gross morphology' (previously 'field') but then another ID (geographic distribution ---> subspecies) is added when it's cataloged. We need a way to be able to enter both of these IDs and their respective 'nature of ID' at the time of cataloging.

I did find this old issue but that's not the one I'm thinking of. I'm pretty sure that I created an issue for it. Create a new issue?

Agree with high low confidence. After all, IDs change, so "certain" is not
appropriate.

On Fri, Jul 19, 2019 at 7:32 AM Carla Cicero notifications@github.com
wrote:

--> Are we agreed on "confidence" for the new field?

Fine with me.

---> I'm fine with "[high|low] confidence" for the terminology, but I'm
also fine with absolutely anything else that gets the idea across.

High/low confidence is ok with me.

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBAXRNH6ZITTA4TLKFTQAG7AHA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2LUJKI#issuecomment-513229993,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ADQ7JBATAFX6JMZZQUCHA4DQAG7AHANCNFSM4IEIGLOA
.

@ccicero thanks! Definitions look great/I'm OK with all of that.

That issue is as good as any - it's how I'll get there. PLEASE prioritize it, we can merge if anyone ever finds the other.....

I'd like to make the case again for 'high' vs 'not-high' confidence and
here's why:

high = > ~90% confidence
not high = < ~ 90% confidence

low = < ~50 % confidence

not high and low mean different things. Low leaves a bunch of 'not high'
(medium?) confidence unavailable as an option.

-Derek

On Fri, Jul 19, 2019 at 6:17 AM dustymc notifications@github.com wrote:

@ccicero https://github.com/ccicero thanks! Definitions look great/I'm
OK with all of that.

That issue is as good as any - it's how I'll get there. PLEASE prioritize
it, we can merge if anyone ever finds the other.....

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUMZZMFAQBJZPF7THGADQAHEF7A5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2LYB4Q#issuecomment-513245426,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM4YMZIHZX4QV2WPLITQAHEF7ANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

@DerekSikes OK, but I don't like 'not high' as a term. What about this:

term = identification_confidence

values:
confident: Identification is made with 90% or higher confidence.
not confident: Identification is made with less than 90% confidence.

???

or

more confident: Identification is made with 90% or higher confidence.
less confident: Identification is made with less than 90% confidence.

because 89% is still fairly confident, just not quite as good as 90%.

???

I think I'm digging the slider idea more and more...

If I found that 10% (or ANY percent really) of the things I'd thought were walrus were in fact not, I'd just be flabbergasted. Assigning a 90% (or 99%) confidence to that is a HUGE degradation of the data; it just doesn't remotely represent the situation, which is that there are a limited number of possible taxa that present as 5000 pound carcasses with a distinctive shape, texture, smell, and dentition on the beaches of western Alaska. Identifying walrus from ancient ivory carvings is probably a bit trickier.

On overwintered shrews from just the skull, I might start buying lottery tickets if I found out I got 90% of them right, and I don't think that's a drastically different number for "experts" - the diagnostic features just aren't there. Geneticists could certainly do much better - their "diagnostic features" don't get ground away.

I assume that sort of variability is fairly common across taxa/technique/individual, and so should be available in our evaluation of ourselves.

A slider would facilitate the full range of confidence, and also get us out of this vocabulary trap. It's an intuitive approach to this sort of data, and can be implemented through a familiar UI.

UI would look something like https://arctos.database.museum/demo - that's just the one-line, out-of-the-box implementation, it can be styled in any way.

Our collection may be an outlier, but we find the distinction between "legacy," "student," and "expert" to be of value and in Dusty's table, all of them become "unknown." For us, a legacy ID (40% of our collection) is whatever arrived with the specimen. (I know that's not the exact definition in the table, but it's the closest we had to chose from.) Well over half of our lots arrive with an ID from a collector or dealer. We may have no idea what the collector's confidence level was or what the basis for the ID was, but if we haven't changed it, it is an accepted taxon name. In the ID remarks, we note if the ID has been confirmed and by whom.

A student ID (36% of our collection) is made by any and all of our volunteers as none of us is a qualified expert, but we do research the specimen and have a modest degree of confidence. Only 2% of our specimens have been reviewed by a true expert.

To me, these are important distinctions and should be to a researcher as well. I hate to have the nature of the IDs of >75% of our collection become "unknown" when we do know something about the nature, timing and quality of the ID. Are we trying to replace this information with the two options in the "degree of confidence" field? That still doesn't convey the degree of knowledge of the person making the ID (if known).

Before this is implemented, can we see a visual of how the data entry "Identification" section would look and the options in the controlled fields?

whatever arrived with the specimen

That was the intent of legacy, and now unknown - "this is what we have."

no idea what the collector's confidence level

I think this must be a NULLable concept for precisely that reason.

In the ID remarks, we note if the ID has been confirmed and by whom.

I'm sure that's common, but it's also of extremely limited utility - it's useful for someone who's already found the specimen and is willing to read the remarks. Adding an identification makes that information much more usable.

75% of our collection become "unknown"

That's just a default. If you make generally make identifications by "gross phenotype" (or whatever) then we can use that as the default for your collection, or bits-and-pieces of it, or whatever.

Are we trying to replace this information with.... confidence

Yes, but it offers a lot more precision when that's available.

two options

Agreed, more or less - that's why I'm leaning towards something like a slider.

visual

This...

Screen Shot 2019-07-19 at 8 57 30 AM

would use terminology from https://docs.google.com/spreadsheets/d/1JfbcpVYTK73DKRkgJQ0jNrFziS9zcbx7lEq872WrfYs/edit#gid=115578899, and something like

https://arctos.database.museum/demo

Screen Shot 2019-07-19 at 8 58 16 AM

or some dropdown of some terms or something would be inserted near it.

Missing from the last reply: Once we've established some agreeable means of recording identification_confidence, we can talk about "translating" expert, student, etc. to it. This should not be a lossy process.

Should the title of this field go from Nature of ID to ID Methodology?

This does still leave out what we're eliminating from this field - the nature of the person or situation in which the ID is made: student, expert, legacy, temporary, field. Can this be captured here:

we can talk about "translating" expert, student, etc. to it. This should not be a lossy process.

And thanks for the new word for the day: lossy!

Is the Confidence-o-Meter optional?

Nature of ID to ID Methodology?

I don't think so - this better encompasses the "basic or inherent features" of the identification - eg, someone looked at morphology. Methodology can come from id_sensu, and I'm not sure there's a lighter-weight proxy to that.

the nature of the person

That's in their agent record - eg, review Carla's publications, projects, collections, etc., and you might start to get the idea that she knows something about birds. Adding dates to relationships will eventually let you get a bit deeper in that - eg, maybe Carla's IDs made when she was an undergrad should be weighted differently.

It's also not something natural to Carla, as far as I know - I'm guessing she'd not be quite as "expert" at identifying liverworts or larval jellyfish (but who knows!) - it's just part of the context needed to evaluate a specific identification.

student

This has always been ambiguous, but some sort of lower confidence than expert should get you to one of the things it might have been intended to cover.

expert

As above.

legacy

That's just another word for "unknown" in this context.

temporary

I don't understand why this might ever exist.

field

gross phenotype I think, but I've never quite understood how this was distinct from all of our other vague terms, so...

Confidence-o-Meter

Nice!

optional

yes

I think this must be a NULLable concept for precisely that reason.

I like the idea of a slide-o-meter for confidence level (which would be nullable), rather then arbitrary and more subjective categorical divisions.

I think "Nature of ID" is descriptive - to me that describes what an ID is based on, but we've been confounding it with who it was based on (expert, student) which is more appropriate in the context of agents as Dusty described.

legacy - probably 'gross phenotype' is better than unknown (?) but good question on whether that term makes sense for cultural and paleo collections. Rather than 'gross phenotype,' what about 'gross traits' to make it more broadly applicable? A basket with specific types of plant or animal matter can be described by traits, as can a fossil bone, but phenotype doesn't seem appropriate in those cases.

'gross features' is better than 'gross traits' - I updated the google doc accordingly, see definition.

gross features

I like it. @AJLinn

I don't much like 'related kin'==>kin kin/related relatives.

I'm wondering if we should also push 'genomics' in the direction of something like 'chemical analysis' (not crazy about that term) to encompass things like https://physicsworld.com/a/nuclear-fallout-used-to-spot-fake-art/, or if that (when/if we get those data) should be another category?

@marecaguthrie

What about "trait- based" instead of " gross traits". Sometimes traits are
gross, but not always :)

We need "kin relationship" for things like captive bred wolves etc. None of
us are confirming traits when the source is a captive breeding population.

I don't think we need to go into the details of the genetic vs genomic IDs.
If we go that route, we'd have to distinguish single locus vs multilocus
(and how many loci?) nuclear vs mitochondrial or some combo thereof with
or without added morphological traits vs whole genome but which parts of
the genome etc etc. How about just "molecular"?

On Sat, Jul 20, 2019, 9:27 AM dustymc notifications@github.com wrote:

gross features

I like it. @AJLinn https://github.com/AJLinn

I don't much like 'related kin'==>kin kin/related relatives.

I'm wondering if we should also push 'genomics' in the direction of
something like 'chemical analysis' (not crazy about that term) to encompass
things like
https://physicsworld.com/a/nuclear-fallout-used-to-spot-fake-art/, or if
that (when/if we get those data) should be another category?

@marecaguthrie https://github.com/marecaguthrie

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBBDQ4J3IU35MWUY6ALQAMVHDA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2NQQIQ#issuecomment-513476642,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ADQ7JBATHWOKWAFDCAR6573QAMVHDANCNFSM4IEIGLOA
.

The attempted distinction is between "to the eye, it looks like a ..." and "we stuffed it under a microscope/CT scanner/etc., and there it looks like a ...."

The concept of "ID of kin" is safe, I just don't like the redundant wording - kin and relatives means about the same thing.

If single locus vs multilocus is something that users might want to search on, I have no problem adding it (now or later). If not, I think we should be as generic as possible. (I suspect 'multilocus' can only come usefully from id_sensu - a publication describing the details - and general terms are what's needed here, but ???) "Molecular" seems reasonable to me.

Sorry to be out of this - I muted the thread thinking it wasn't really relevant to us as it's a field we often don't spend a lot of time thinking about, but in looking thru the google doc, I'm not seeing a lot of usefulness for cultural collections.

gross features

That term doesn't have previous meaning for cultural collections... but it's infinitely better than 'gross phenotype', which I have no idea what it means! 🄓I think we can make it work. I don't know of a better way of indicating how the ID was made that would work for all.

As Carla referenced, cultural collections people can partially ID items based on materials, but also design elements, overall form, or overall perceived function. But we make ID judgements about a variety of things (materials, culture of origin/use, copyright applicability, or even maker at times if it's not signed but looks exactly like another piece in the collection).

For cultural collections the ID field is referencing the name of the item (is it an arrow head or a spear head, is it a basket or a hat, is it a meat platter or a cutting board, a tea pot or coffee pot, a man's parka or woman's parka), which is usually directly tied to it's function. It can be complicated with objects are re-used (e.g., a leg trap spring is reused as a scraper blade or a file is turned into a knife). We get at that complexity in our attribute "description."

Most of our IDs right now are "legacy". New nature of ID labels are either curatorial (me) or student (my students doing data entry). If we have an elder come thru and correct an ID, we change it to "expert". We have lots of accidental "ID of kin" because of its default position - but I also sometimes use it intentionally with the thought: "this object looks just like another item that is IDed as a fish spear prong, so it must be a fish spear prong".

I actually wouldn't mind having a slide-o-meter for some of those other areas (attributes mostly) where we're making "educated guesses" because I'm often surprised by how clearly not-correct some of the IDs are.

I think for archaeology collections (and likely paleo as well), this is even more difficult and more important because they're looking at fragments of things. Archaeologists at our museum often come look at full-sized modern pieces in the ethnology & history collection to decide if a fragment is part of a tobacco or coffee can, or a birch bark basket rim vs. birch bark canoe top plate.

Sorry to be jumping in late and missing most of the discussion. I'm not sure how to make the nature of ID more applicable for cultural collections given these complications, but I hope you consider us in your changes. Most of the terms in the google sheet right now are not very helpful for us honestly.

---> I don't like 'trait-based' for the reasons Dusty gave. A fine count of scales or a wing measurement is also trait based, but that is different from an overall qualitative assessment based on what something looks like.

---> I've never really liked 'ID of kin' - sounds awkward to me. I do prefer 'kin relationship' even though there is slight redundancy, because I think it's clearer.

---> @AJLinn I added 'function' as a term to address your points, see definition in google doc to see if that makes sense.

Also, to address what Angela was saying about identifying cultural items based on others that look the same, I changed 'ID of kin' / 'kin relationship' to just 'relationship' so that it can apply more broadly to both cultural and natural history collections. This also makes the most sense for things like nests. Here is the definition I put in the google doc:

Identification based on the identification of a related individual (e.g., parent, sibling), a maker (e.g., for objects such as nests or cultural items), or of another object that has the same function and/or features (e.g., for cultural items).

Should "fine morphology" become "fine features" (detailed features? something features...) to better contrast with "gross features"?

Maybe "general features" and "detailed features"? Qualitative features/quantitative features??

"It's obviously a teapot" is definitely what we're trying to get at with "gross features" (whatever the label ends up being).

"There are signs of percussive flaking under microscope/xray/etc., therefore it must be a Whatever Point" is "fine morphology" (again, whatever we're calling it).

"We found _Camellia sinensis_ residue on this potsherd, so it was probably a teapot" is "genomics" - the concept, we'd probably need a new label if we get those data.

Note also that the "trust factor" built into Agents doesn't live in isolation - eg if a student identifies a bunch of things, then an "expert" (eg, an agent with a bunch of relevant publications) comes through and reidentifies them, you can use the correlation between those IDs to figure out that the student is generally right (or wrong...) and use that to weight their other IDs. That's also yet another reason to create IDs when you have them instead of eg, stuffing "confirmed by {cryptic initials}" in some remarks field or something.

I like it!

I had the same thought and like the contrast between 'fine' and 'gross' features - see modified description. Fine-scale analysis may be qualitative (e.g., shape of hummingbird feathers) so that contrast doesn't work. I modified the description to reflect that.

genomics vs molecular: I think molecular is better. DNA sequence of cytochrome b isn't genomic, but it is molecular.

minor detail - should we be consistent in adjective versus noun. e.g., 'molecules' vs 'molecular' ??? I think that would look better so changed made that change in the doc.

btw, I increased the priority of this related issue to high.

I don't like "molecules" - it may be semantically consistent, but it is not
in any usage in the sense of "molecular" genetics. I would advocate for
using terminology consistent with usage in the literature and in the rest
of the research and collections community.

On Thu, Jul 18, 2019 at 6:18 PM dustymc notifications@github.com wrote:

>

https://docs.google.com/spreadsheets/d/1JfbcpVYTK73DKRkgJQ0jNrFziS9zcbx7lEq872WrfYs/edit#gid=115578899

There are two tabs - 'migration path' will remain very waffly until
'ctnature_of_id' is solidified, and will only come into play after we've
dealt with whatever needs dealt with by itself (Derek's "all IDs by experts
and students would be method = phenotype" and similar).

Are we agreed on "confidence" for the new field?

I'm fine with "[high|low] confidence" for the terminology, but I'm also
fine with absolutely anything else that gets the idea across. Like all
Arctos vocabulary, the term means precisely what we define it to mean, and
using it in any way outside the definition we assign is objectively wrong.

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBC742V344SLGOZERRLQAEB4PA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2KGICA#issuecomment-513041416,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ADQ7JBBHFRDTK3M3Q5Q7343QAEB4PANCNFSM4IEIGLOA
.

ok, changed it back.

I made a few more comments on the google doc. Please review.

On Sat, Jul 20, 2019 at 2:47 PM Carla Cicero notifications@github.com
wrote:

ok, changed it back.

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBDISXKIKNAVDEPYJUDQAN2VTA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2NVXUI#issuecomment-513498065,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ADQ7JBHGCZQGSVR2BSDK5FDQAN2VTANCNFSM4IEIGLOA
.

I'm not sure who changed it back to genomics, but I don't think that's accurate. What about just 'DNA' ?

'coarse features' - ok with me, nice (better) contrast with 'fine features'

'audio-visual' - yes, that's better!

changed it back to genomics

Edit history says I did, but certainly not intentionally. I changed it to DNA from CC's suggestion, I'm not attached to that (or anything else, except the idea).

I'm not 100% attached to the idea, but I think there's a distinction between audio where you have only one kind of information and video where you potentially have much more. I suppose we can sort it out when we get some data...

I think for the purposes of ID, audio-visual is sufficient. 'audio' could also include some behavioral aspect (observe a bird singing from the top of a juniper bush, and it sounds like an Oak Titmouse). Other information might go into 'habitat' or 'attributes' of some kind or another (and we could probably expand on attributes for behavioral data). As you said, can be sorted out later.

This brings up another point for the purposes of migration: basis of record will be important, because how data are migrated will depend on context (observation, preserved specimen, cultural item, etc.). For example, I would say that all bird observations with a 'field' nature of ID and an associated media would be changed to "audio-visual' - we should modify the migration tab to incorporate this somehow.

I like where this is going and moving away from subjective expert vs. non-expert descriptors. I am ok with adding a slider but I propose we style it in a way that doesn't give false-precision to the confidence value (is 63% confidence significantly more meaningful than 65%? How do you determine this anyway?). Perhaps there can be intervals - enough so that we aren't degrading the data but not so many so as to drive ourselves nuts fretting over the minutia (e.g. 100%, >90%, >80% 70-50%,,<50% .... or something). When we have an expert redetermine or confirm ID for loans, do we unrealistically ask them to rate their confidence? When they don't include their confidence ratings do we leave this field null despite the fact we know the specimens have been examined by a specialist? Or do we introduce some level of educated interepretation (>80%?)? If so, I think ranges vs. straight integers allows us to accurately convey confidence without getting overly precise.

As long as I can leave the rating slider null (not zero, NULL), I don't care, because I am not going to make any judgement calls about the relative expertise of anyone's ID (nor do I want to have the person who made the ID rate themselves). The move to objective means of identification is awesome an I totally support it.

I think much of that's documentation??

I can round to any precision....

UAM@ARCTOS> select round(45.9995476788,-1) from dual;

ROUND(45.9995476788,-1)
-----------------------
             50

but that might be more misleading/confusing.

At some point the ID (and probably specimen) bulkloader will likely need to accept a NUMBER and downloads will contain a NUMBER as well; we have limited control over UI.

The documentation should stress that this is a subjective thing which should be set to "that feels about right."

Any query access will need carefully and similarly documented; most of these data will be NULL, and so setting this to any value will exclude most specimens. I don't think it will be useful for "normal" specimen searches, it will be used by "us" to find the situations we created by using the tool.

I'm inclined to just accept whatever I'm given (within some defined range), mostly because I think it's least confusing. I suppose if someone really wants to set up some distinction between 89.00000345 and 89.00000346 they can, all we can do it make the documentation clear....

I changed back to "molecular" because this can include proteins,
immunology, RNA, future tech we don't know about yet . . . Otherwise we
have to specify each one . . .

On Sat, Jul 20, 2019 at 6:43 PM dustymc notifications@github.com wrote:

changed it back to genomics

Edit history says I did, but certainly not intentionally. I changed it to
DNA from CC's suggestion, I'm not attached to that (or anything else,
except the idea).

I'm not 100% attached to the idea, but I think there's a distinction
between audio where you have only one kind of information and video where
you potentially have much more. I suppose we can sort it out when we get
some data...

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBFN4ZGO7K7U5PPROMTQAOWK7A5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2NYRNA#issuecomment-513509556,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ADQ7JBHZUK67FSDBLFTFKIDQAOWK7ANCNFSM4IEIGLOA
.

I'm not a fan of the slider concept. How are students going to use this in
data entry? How do we assign in the field on a rapid assembly line? What
does it mean, anyway? It seems most useful for specimens that have been
subsequently examined curatorially to indicate if something has indeed been
examined by an expert on that particular taxonomic group. Everything else
by anyone else is "not enough info to assess confidence".

On Mon, Jul 22, 2019 at 10:38 AM Mariel Campbell campbell@carachupa.org
wrote:

I changed back to "molecular" because this can include proteins,
immunology, RNA, future tech we don't know about yet . . . Otherwise we
have to specify each one . . .

On Sat, Jul 20, 2019 at 6:43 PM dustymc notifications@github.com wrote:

changed it back to genomics

Edit history says I did, but certainly not intentionally. I changed it to
DNA from CC's suggestion, I'm not attached to that (or anything else,
except the idea).

I'm not 100% attached to the idea, but I think there's a distinction
between audio where you have only one kind of information and video where
you potentially have much more. I suppose we can sort it out when we get
some data...

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBFN4ZGO7K7U5PPROMTQAOWK7A5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2NYRNA#issuecomment-513509556,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ADQ7JBHZUK67FSDBLFTFKIDQAOWK7ANCNFSM4IEIGLOA
.

include proteins, immunology, RNA, future tech we don't know about yet. . . Otherwise we have to specify each one . . .

This is a question for users, but perhaps specifying each one is important - eg, species hypotheses built on "future tech we don't know about yet" [do|do not] align with species hypotheses built on {something else} and users will want to find those specimens and request loans.

Or we could see this as a not-so-precise category and get at "future tech we don't know about yet" via id_sensu (publications).

I think one category, whatever we call it, is probably OK for now, just something to keep in mind...

I'm not a fan of the slider concept.

So don't use it.

What does it mean, anyway?

By definition, probably something like 'subjective measure of how confident the identifier is in the identification." By usage, whatever Curators want it to mean...

It provides a pathway to get at the things Derek mentioned without engaging in an endless vocabulary debate, and it does it without requiring anything from anyone who doesn't want to use it or taking anything of value from identifications which haven't specified a 'confidence.'

"not enough info to assess confidence"

That's NULL, which would be the default condition.

I imagine using NULL in the majority of cases for our collection so I am not all out opposed to the slider. That said, I wonder if it is more straightforward to have some sort of toggle, e.g., "ID tentative" rather than attempting to quantify the degree of uncertainty for an identification in an unmethodical way across multiple collection types and personal assertion styles (and essentially reintroducing subjectivity, e.g., 80 vs 85% cetain). A flagged record could still be elaboarated upon with a textual description (or alas, even a numeric rating) in the ID_remarks if desired and is easily searched. It just seems like if we create some sort of Confidence-o-Meter and then find a better structure down the line, we then have to migrate all of these [overly] precise values into the new model that hold varying degrees of meaningfulness and which will likely be interpreted as [overly] significant for future users. For me, just knowing an ID is uncertain is valuable - regardless of whether the determiner was 80% or 45% confident, I know that specimens needs further examination and should likely be excluded from niche models, etc.

I prefer Emily's alternative to the slider.

As for:
get at "future tech we don't know about yet" via id_sensu (publications)

Yes, we just need to rely on liking to the publication. It is hard enough
for curatorial staff to just find and link publications. To add a
requirement that we read through the entire methods section of each pub to
figure out exactly what technique they used and translate that into some
Arctos terminology would not be reasonable. It is hard enough for me to
find those studies just based on "genetic" or "molecular" vs "morphology".
Parsing it further would require an enormous amount of time and effort.

On Mon, Jul 22, 2019 at 11:03 AM Emily Braker notifications@github.com
wrote:

I imagine using NULL in the majority of cases for our collection so I am
not all out opposed to the slider. That said, I wonder if it is more
straightforward to have some sort of toggle, e.g., "ID tentative" rather
than attempting to quantify the degree of uncertainty for an identification
in an unmethodical way across multiple collection types and personal
assertion styles (and essentially reintroducing subjectivity, e.g., 80 vs
85% cetain). A flagged record could still be elaboarated upon with a
textual description (or alas, even a numeric rating) in the ID_remarks if
desired and is easily searched. It just seems like if we create some sort
of Confidence-o-Meter and then find a better structure down the line, we
then have to migrate all of these [overly] precise values into the new
model that hold varying degrees of meaningfulness and which will likely be
interpreted as [overly] significant for future users. For me, just knowing
an ID is uncertain is valuable - regardless of whether the determiner was
80% or 45% confident, I know that specimens needs further examination and
should likely be excluded from niche models.

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBANU5FMT4CKGUU4HHDQAXR6LA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2QQJ6A#issuecomment-513869048,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ADQ7JBEDEIFFQCQGQQKPZMDQAXR6LANCNFSM4IEIGLOA
.

toggle, e.g., "ID tentative"

I'm certainly fine with that, but it didn't look like we were headed for vocabulary resolution above. Something symbolic (slider, stars, whatever) doesn't require us to define terms - that's essentially all of my "recommendation." Derek seems to the the one with a solid use case; I'm happy enough to do whatever he tells me to do, as long as nobody else is screaming too loudly....

ID_remarks

No free-text field anywhere is usefully searchable, at least not as long as we let humans provide the values. Wee maek typograaphical eerors n etc....

It is hard enough for curatorial staff to just find and link publications.

I don't doubt that, but it provides a level of precision that nothing else can. It's certainly never going to be anything like a requirement, but if it's there it becomes a powerful tool that users can access for all sorts of reasons; it makes your data more able to answer more questions.

entire methods section of each pub to figure out exactly what technique they used and translate

That would be (can be now) up to the user, not "us." Any translation would be a downgrade. Maybe users want specimens IDed genetically using some primer that they don't trust - they can get that from id_sensu, it's not something we'd realistically be able to to predict, summarize, and/or categorize.

"genetic" or "molecular"

Those two values in the same list would probably lead to ambiguous usage.

I've been considering the slider idea and have fears there will be
unforeseen undesirable consequences. Mariel brought one up I hadn't
considered - how to bulkload the data? Also, how do we search on the field
if we don't know what's in it? or would a value 0-100 appear as the slider
is set? We could then search on those values?

I still think this brings in massive over-precision. I might set a value of
85 one day and 87 another and 90 or 86 another day all for the same basic
ID job.. it's asking users to split hairs and make impossible decisions.
Even if we just had the scale run from 1-10 (an improvement) it would still
be hard to decide.

I really like a binary choice - is this a reliable ID or not? (or null for
unknown)

-Derek

On Mon, Jul 22, 2019 at 9:30 AM dustymc notifications@github.com wrote:

toggle, e.g., "ID tentative"

I'm certainly fine with that, but it didn't look like we were headed for
vocabulary resolution above. Something symbolic (slider, stars, whatever)
doesn't require us to define terms - that's essentially all of my
"recommendation." Derek seems to the the one with a solid use case; I'm
happy enough to do whatever he tells me to do, as long as nobody else is
screaming too loudly....

ID_remarks

No free-text field anywhere is usefully searchable, at least not as long
as we let humans provide the values. Wee maek typograaphical eerors n
etc....

It is hard enough for curatorial staff to just find and link publications.

I don't doubt that, but it provides a level of precision that nothing else
can. It's certainly never going to be anything like a requirement, but if
it's there it becomes a powerful tool that users can access for all sorts
of reasons; it makes your data more able to answer more questions.

entire methods section of each pub to figure out exactly what technique
they used and translate

That would be (can be now) up to the user, not "us." Any translation would
be a downgrade. Maybe users want specimens IDed genetically using some
primer that they don't trust - they can get that from id_sensu, it's not
something we'd realistically be able to to predict, summarize, and/or
categorize.

"genetic" or "molecular"

Those two values in the same list would probably lead to ambiguous usage.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM2QF3YEWAD7623UHRDQAXVCRA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2QSS7Y#issuecomment-513878399,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM27SVTUHKUZGNTFVW3QAXVCRANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

how to bulkload the data?

The slider would just be an abstraction of some range of values - 0-1 or 0-10000000000 or whatever - I think it's become "percent confidence" in my head but that's probably overly precise/wrong.

Arctos is unicode, so something like a code table of

IN (NULL,⭐,⭐⭐,⭐⭐⭐,⭐⭐⭐⭐,⭐⭐⭐⭐⭐)

is completely possible as well. Not suggesting we should, but we could - we are not limited to text/ASCII.

I might set a value of 85 one day and 87 another and 90 or 86 another day all for the same basic
ID job.

Yep, probably - I guess in addition to the terminology thing, you (or someone) mentioned some "expert" threshold above that sounded really low for some use cases. The slider would let you-the-user and you-the-operator define "expert" (or "unreliable" or whatever) however you want, assuming there's some sort of grouping around sliding the widget ALMOST all the way to the right for "I'm pretty confident in this" - and that may be an unjustified assumption.

binary

I'm certainly fine with that if we can find acceptable vocabulary. It's a lot easier to code too!

null for unknown

It's overly pedantic for this conversation, but those are not the same thing. NULL ==> we say nothing (including about why we're saying nothing). "Unknown" ==> we don't have the information. (https://arctos.database.museum/info/ctDocumentation.cfm?table=CTSEX_CDE contains a couple more ways we can get at not having anything to say.)

null for unknown

It's overly pedantic for this conversation, but those are not the same thing. NULL ==> we say nothing (including about why we're saying nothing). "Unknown" ==> we don't have the information. (https://arctos.database.museum/info/ctDocumentation.cfm?table=CTSEX_CDE contains a couple more ways we can get at not having anything to say.)

I think it is an important distinction...

agree.

On Mon, Jul 22, 2019 at 1:00 PM Teresa Mayfield-Meyer <
[email protected]> wrote:

null for unknown

It's overly pedantic for this conversation, but those are not the same
thing. NULL ==> we say nothing (including about why we're saying nothing).
"Unknown" ==> we don't have the information. (
https://arctos.database.museum/info/ctDocumentation.cfm?table=CTSEX_CDE
contains a couple more ways we can get at not having anything to say.)

I think it is an important distinction...

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBFXWPDPN27CMO3RO3LQAX7VRA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2Q2XXQ#issuecomment-513911774,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ADQ7JBFQNU7LYWKKDFP4HNLQAX7VRANCNFSM4IEIGLOA
.

Agreed

4 options for ID confidence, how about:

reliable
uncertain reliability
unknown
NULL

I really think we should avoid making clear statements that the reliability
is poor or unreliable etc. because that seems too strong & we don't want to
be making former nature of IDs with 'student' = 'unreliable'... a much
better term is ' uncertain reliability'

-D

On Mon, Jul 22, 2019 at 11:00 AM Teresa Mayfield-Meyer <
[email protected]> wrote:

null for unknown

It's overly pedantic for this conversation, but those are not the same
thing. NULL ==> we say nothing (including about why we're saying nothing).
"Unknown" ==> we don't have the information. (
https://arctos.database.museum/info/ctDocumentation.cfm?table=CTSEX_CDE
contains a couple more ways we can get at not having anything to say.)

I think it is an important distinction...

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUMYMXWGCRZECULPA5BLQAX7VRA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2Q2XXQ#issuecomment-513911774,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM3IMKRO33PCXERUPS3QAX7VRANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

I'm OK with that, but there have also been times when I might have used an "unreliable" option. Something's not some Eurasian species so I know more than genus (the next-highest taxonomic pigeonhole), but I'm still not entirely confident that the species I've picked is correct. I suppose "A {string}" gets there too, but it isn't as searchable as I'd like.

There would need to be some clear separation between "uncertain reliability" and "unknown" (and NULL) - I'm not sure I see that just from the vocabulary.

Perhaps we need to treat these as flags going forward, but legacy data will
have to be left as null. I would not want to convert all our "student" IDs
to uncertain reliability if that meant those records would be excluded from
niche modeling, for example. Most everything we have is "student".

On Mon, Jul 22, 2019 at 1:15 PM dustymc notifications@github.com wrote:

I'm OK with that, but there have also been times when I might have used an
"unreliable" option. Something's not some Eurasian species so I know more
than genus (the next-highest taxonomic pigeonhole), but I'm still not
entirely confident that the species I've picked is correct. I suppose "A
{string}" gets there too, but it isn't as searchable as I'd like.

There would need to be some clear separation between "uncertain
reliability" and "unknown" (and NULL) - I'm not sure I see that just from
the vocabulary.

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBCCNNGPU476TALJ3ATQAYBM7A5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2Q376Q#issuecomment-513916922,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ADQ7JBECR2YEQZ25HO5WDPDQAYBM7ANCNFSM4IEIGLOA
.

I keep forgetting to mention this but in that case, when you want to
indicate unreliable, why not just use ID formula A ?

-D

On Mon, Jul 22, 2019 at 11:15 AM dustymc notifications@github.com wrote:

I'm OK with that, but there have also been times when I might have used an
"unreliable" option. Something's not some Eurasian species so I know more
than genus (the next-highest taxonomic pigeonhole), but I'm still not
entirely confident that the species I've picked is correct. I suppose "A
{string}" gets there too, but it isn't as searchable as I'd like.

There would need to be some clear separation between "uncertain
reliability" and "unknown" (and NULL) - I'm not sure I see that just from
the vocabulary.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM4KKCLNNS3FR26V54DQAYBM7A5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2Q376Q#issuecomment-513916922,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUMY5PUQQV3PHHBKJPITQAYBM7ANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

Mariel,

If we did go this route then it sounds like you'd like your 'student' IDs
changed to 'reliable' ? I suppose Dusty could do this on a case-by case
basis. But then how would you distinguish 'world expert/taxonomic
authority' based IDs from the rest in your collection?

-Derek

On Mon, Jul 22, 2019 at 11:17 AM Mariel Campbell notifications@github.com
wrote:

Perhaps we need to treat these as flags going forward, but legacy data will
have to be left as null. I would not want to convert all our "student" IDs
to uncertain reliability if that meant those records would be excluded from
niche modeling, for example. Most everything we have is "student".

On Mon, Jul 22, 2019 at 1:15 PM dustymc notifications@github.com wrote:

I'm OK with that, but there have also been times when I might have used
an
"unreliable" option. Something's not some Eurasian species so I know more
than genus (the next-highest taxonomic pigeonhole), but I'm still not
entirely confident that the species I've picked is correct. I suppose "A
{string}" gets there too, but it isn't as searchable as I'd like.

There would need to be some clear separation between "uncertain
reliability" and "unknown" (and NULL) - I'm not sure I see that just from
the vocabulary.

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBCCNNGPU476TALJ3ATQAYBM7A5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2Q376Q#issuecomment-513916922
,
or mute the thread
<
https://github.com/notifications/unsubscribe-auth/ADQ7JBECR2YEQZ25HO5WDPDQAYBM7ANCNFSM4IEIGLOA

.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM42D73A2AXLNYQEOM3QAYBVXA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2Q4FSQ#issuecomment-513917642,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM5J7BXQUFLFUO4KSXTQAYBVXANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

We have expert for some of our IDs made by experts in that group who have
examined the specimens through some criteria. But most of our IDs are made
by collection managers who use "student" because they are not experts in a
particular group, even though they are qualified taxonomic professionals.
This is what we've been instructed to consider "student" to mean.
Even our "field" IDs are usually made by people who are consulting keys and
have some knowledge of the taxa involved - but we are missing key
characters (e.g. cleaned dentition) to make a full diagnosis; and the
confidence depends largely on the species richness and geographic
distribution of the taxonomic group at a particular locality . . .

On Mon, Jul 22, 2019 at 1:21 PM DerekSikes notifications@github.com wrote:

Mariel,

If we did go this route then it sounds like you'd like your 'student' IDs
changed to 'reliable' ? I suppose Dusty could do this on a case-by case
basis. But then how would you distinguish 'world expert/taxonomic
authority' based IDs from the rest in your collection?

-Derek

On Mon, Jul 22, 2019 at 11:17 AM Mariel Campbell >
wrote:

Perhaps we need to treat these as flags going forward, but legacy data
will
have to be left as null. I would not want to convert all our "student"
IDs
to uncertain reliability if that meant those records would be excluded
from
niche modeling, for example. Most everything we have is "student".

On Mon, Jul 22, 2019 at 1:15 PM dustymc notifications@github.com
wrote:

I'm OK with that, but there have also been times when I might have used
an
"unreliable" option. Something's not some Eurasian species so I know
more
than genus (the next-highest taxonomic pigeonhole), but I'm still not
entirely confident that the species I've picked is correct. I suppose
"A
{string}" gets there too, but it isn't as searchable as I'd like.

There would need to be some clear separation between "uncertain
reliability" and "unknown" (and NULL) - I'm not sure I see that just
from
the vocabulary.

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<

https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBCCNNGPU476TALJ3ATQAYBM7A5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2Q376Q#issuecomment-513916922

,
or mute the thread
<

https://github.com/notifications/unsubscribe-auth/ADQ7JBECR2YEQZ25HO5WDPDQAYBM7ANCNFSM4IEIGLOA
>

.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM42D73A2AXLNYQEOM3QAYBVXA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2Q4FSQ#issuecomment-513917642
,
or mute the thread
<
https://github.com/notifications/unsubscribe-auth/ACFNUM5J7BXQUFLFUO4KSXTQAYBVXANCNFSM4IEIGLOA

.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBHGJWEXOG6E643NKQTQAYCBVA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2Q4NKI#issuecomment-513918633,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ADQ7JBEXWS5RDROOIWKO5P3QAYCBVANCNFSM4IEIGLOA
.

Was just typing what Derek said re: formula A ?. A slider also wouldn't convey that you are certain about genus designation, still wavering between a couple species but can confidently rule out all Eurasian taxa in a given group. The pick list: reliable/certain, unreliable/tentative, unknown, null (or whatever values we choose) would be an intuitve way to flag records without having to think too hard as well as a means to conviently call up records with uncertain IDs in a search. ID_remarks and/or formulas would then further detail the thinking behind tentative determinations.

If we decide to map all student IDs to NULL, perhaps we can integrate the confidence flag into the Manage menu. That way we could do global updates for taxa (yes, all skulls identified as Alces alces are reliably moose) or by agent (yes, this collection manager or grad student is good for bird IDs) or whatever parameter. I will very likely leave current student values NULL and make opportunistic changes when I happen upon them, but would use the flag moving forward for new records.

legacy data will have to be left as null

I can deal with that by collection (or any other criteria ya'll want to put together). I don't think anything global will or should happen. Pushing some confidence value for CollectionA 'expert' IDs and doing something different in CollectionB and ..... isn't much of a problem, and I think hinges on how the existing values have been used up to this point. I'm pretty sure we can do this without losing anything or forcing anyone in any direction they don't want to go in.

meant those records would be excluded from niche modeling,

We obviously can't control how people use data, but I can't really see why anyone would want to do that.

The flag doesn't/can't say anything about the ID, it's just how confident a particular user is in a particular ID. My "oh yea, I think I totally nailed it" is (in light of my "expertise" via my agent record and perhaps my cited publication) almost certainly of less value than Derek's "I'm not very sure about this" on the same beetle, for instance. This field does not and cannot live in isolation, and it should not be seen that way.

why not just use ID formula A

... {string} presumably. Searchability - if you CAN mis-type strings, you WILL mis-type strings, so it's not as discoverable in certain search fields. You can still search by the taxonomy component - I suppose you could mix all that up to say "I'm leaning towards Sorex tundrensis" (ID to species, low confidence) or "it's definitely one of the northern tricolored shrews (ID to genus, a bit more info via the text-based ID, high confidence).

Rather than 'reliable' and 'uncertain reliability' which sounds really awkward to me and am not sure what it really means or how to use it, what about this:

ID certain
ID uncertain
ID certainty unknown
null

Hate to belabor this but I keep worrying about us stating falsehoods.

ID certain is not the same as ID reliable.

The former means 100% - error is impossible, the latter means it's
trustworthy but of course, humans being humans...

so if we don't use 'reliable' I hope we can find an actual synonym

-Derek

On Mon, Jul 22, 2019 at 12:20 PM Carla Cicero notifications@github.com
wrote:

Rather than 'reliable' and 'uncertain reliability' which sounds really
awkward to me and am not sure what it really means or how to use it, what
about this:

ID certain
ID uncertain
ID certainty unknown
null

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM26IWDAQEO3EU67SD3QAYJAZA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2RBKSY#issuecomment-513938763,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM5I6UD7NQQVV3H7D7LQAYJAZANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

I think we moved away from certain because it implies immutability?

On Mon, Jul 22, 2019, 2:20 PM Carla Cicero notifications@github.com wrote:

Rather than 'reliable' and 'uncertain reliability' which sounds really
awkward to me and am not sure what it really means or how to use it, what
about this:

ID certain
ID uncertain
ID certainty unknown
null

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBH5EVSVJD74NQJECTTQAYJAZA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2RBKSY#issuecomment-513938763,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ADQ7JBD2FTEE6JM4CC2XDGLQAYJAZANCNFSM4IEIGLOA
.

As above, "ID _bla_" should be taken to mean "the person(s) making this identification say they're _bla_ of it" and not, ever, under any circumstances, "this identification is actually _bla_." I think we're trying to put meaning that can't exist into this.

If the person making the IDs is also the person entering the ID data into
Arctos we can say "I'm certain/uncertain" of it but in many cases we're
entering IDs made by others... and we still want some way to say how
reliable the IDs are and 'certain' shouldn't be an option.

-D

On Mon, Jul 22, 2019 at 1:08 PM dustymc notifications@github.com wrote:

As above, "ID bla" should be taken to mean "the person(s) making this
identification say they're bla of it" and not, ever, under any
circumstances, "this identification is actually bla." I think we're
trying to put meaning that can't exist into this.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM2TDYGQNIONB6NLAUTQAYOTTA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2RFMDQ#issuecomment-513955342,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM5QNNQL7OFVX4PO5TLQAYOTTANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

How about:
field: "ID confidence"
values: high, medium, low, unknown, null

On Mon, Jul 22, 2019 at 3:08 PM dustymc notifications@github.com wrote:

As above, "ID bla" should be taken to mean "the person(s) making this
identification say they're bla of it" and not, ever, under any
circumstances, "this identification is actually bla." I think we're
trying to put meaning that can't exist into this.

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBCRIL5ED4MWMPT5563QAYOTTA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2RFMDQ#issuecomment-513955342,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ADQ7JBB6R6Q4BH65EG3KJPTQAYOTTANCNFSM4IEIGLOA
.

I'm ok with these... but to which would 'student' map? medium? or low? (or
let each collection decide which)

Rarely an expert will return specimens with a ? next to the ID. How would
we record this? ID confidence = high but use ID formula A ?

-D

On Mon, Jul 22, 2019 at 1:19 PM Mariel Campbell notifications@github.com
wrote:

How about:
field: "ID confidence"
values: high, medium, low, unknown, null

On Mon, Jul 22, 2019 at 3:08 PM dustymc notifications@github.com wrote:

As above, "ID bla" should be taken to mean "the person(s) making this
identification say they're bla of it" and not, ever, under any
circumstances, "this identification is actually bla." I think we're
trying to put meaning that can't exist into this.

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBCRIL5ED4MWMPT5563QAYOTTA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2RFMDQ#issuecomment-513955342
,
or mute the thread
<
https://github.com/notifications/unsubscribe-auth/ADQ7JBB6R6Q4BH65EG3KJPTQAYOTTANCNFSM4IEIGLOA

.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUMYGUDJTXJK7LCHDBATQAYP5FA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2RGIYY#issuecomment-513959011,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUMZTGHGV4ZLYULTXRITQAYP5FANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

or let each collection decide which

Yes - and if a particular collection also wants THAT student to "medium" and THIS one to "low" and some other to NULL I can accommodate that too.

? next to the ID

Ideally you'd ask them (eg, by way of sending them some form that you can upload to Arctos), but if you're assigning things you can obviously do whatever you think they meant or just leave confidence NULL. "I'm sure it might be that" certainly fits....

Ok, are we good with Mariel's suggestion?

field: "ID confidence"
values: high, medium, low, unknown, null

-D

On Mon, Jul 22, 2019 at 1:58 PM dustymc notifications@github.com wrote:

or let each collection decide which

Yes - and if a particular collection also wants THAT student to "medium"
and THIS one to "low" and some other to NULL I can accommodate that too.

? next to the ID

Ideally you'd ask them (eg, by way of sending them some form that you can
upload to Arctos), but if you're assigning things you can obviously do
whatever you think they meant or just leave confidence NULL. "I'm sure it
might be that" certainly fits....

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUMYHN5YBRI6XFPNVHWTQAYUPFA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2RJFZQ#issuecomment-513970918,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM75WKVDL4CWEVVHL2LQAYUPFANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

I'm fine with that.

Are we good to go with this? If so I can

  • add "identification_confidence" using...

    • high==>Determiner(s), or collection personnel on behalf of determiners(s), have high confidence that the identification is correct.

    • medium==>Determiner(s), or collection personnel on behalf of determiners(s), have moderate confidence that the identification is correct.

    • low==>Determiner(s), or collection personnel on behalf of determiners(s), have low confidence that the identification is correct.

    • unknown==>Determiner(s), or collection personnel on behalf of determiners(s), have expressed that they do not know how confident the identification is.

    • (NULL==>nobody asserts anything)

  • add the values from the spreadsheet to the code table
  • set up a temporary trigger to disallow using 'old' values
  • work with ya'll to migrate existing data to the new structure
  • clean up old values

From https://github.com/ArctosDB/arctos/issues/1868: When implementing add drag-drop ordering, replace the JS checks with HTML5 controls

Could we change "Identifier(s)" to "Determiners"

The latter is a synonym of the former but the former is a homonym of those
numbers and codes we assign to specimens... and homonyms can lead to
confusions.

Otherwise, I'm happy with this.

-Derek

On Wed, Jul 24, 2019 at 12:19 PM dustymc notifications@github.com wrote:

Are we good to go with this? If so I can

  • add "identification_confidence" using...

    • high==>Identifier(s), or collection personnel on behalf of

      identifier(s), have high confidence that the identification is correct.

    • medium==>Identifier(s), or collection personnel on behalf of

      identifier(s), have moderate confidence that the identification is correct.

    • low==>Identifier(s), or collection personnel on behalf of

      identifier(s), have low confidence that the identification is correct.

    • unknown==>Identifier(s), or collection personnel on behalf of

      identifier(s), have expressed that they do not know how confident the

      identification is.

    • (NULL==>nobody asserts anything)

  • add the values from the spreadsheet
    https://docs.google.com/spreadsheets/d/1JfbcpVYTK73DKRkgJQ0jNrFziS9zcbx7lEq872WrfYs/edit#gid=115578899
    to the code table
  • set up a temporary trigger to disallow using 'old' values
  • work with ya'll to migrate existing data to the new structure
  • clean up old values

From #1868 https://github.com/ArctosDB/arctos/issues/1868: When
implementing add drag-drop ordering, replace the JS checks with HTML5
controls

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM6OCQCGDSRHTQF24ATQBC2NRA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2XPZZI#issuecomment-514784485,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM5YU5J7NVW5PX4JL7TQBC2NRANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

Determiners

Edited above.

What about saying that high confidence means high confidence that the
determiner asserts that the identification is correct according to
currently accepted taxonomy?

On Wed, Jul 24, 2019 at 3:22 PM dustymc notifications@github.com wrote:

Determiners

Edited above.

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBA3XJBOOJZ4GBD5SS3QBDBYNA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2XU3FI#issuecomment-514805141,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ADQ7JBHCABPFFN3B2MZCSHLQBDBYNANCNFSM4IEIGLOA
.

according to...taxonomy

That's what ID_Sensu does, and confounding it with something else could only result in conflicting and low-quality data.

I'm ok with the confidence score.

Are we settled on the vocabulary? I'm still strugging with 'fine features' and 'coarse features' - I like the contrast, but it just sounds funny to me when applied to birds and isn't really clear without reading the definition. Can we think of something better that works for cultural as well as natural history specimens? Can we use 'fine traits' and 'coarse traits' instead? We use trait-based data all the time, and I think (?) that is clearer for natural history phenotypic information but does it work for cultural collections @AJLinn ?

We need to work on the migration path.

fine features' and 'coarse features'

It's not COMPLETELY painless, but changing vocabulary, as long as we're not altering the concept/meaning, isn't too difficult. I'm not thrilled with the current vocab either, but I think it sufficiently says what we're trying to get across.

I think "trait" may in some communities be tainted with the implication of precise measurements, but otherwise I like it....

I've made some minor updates to the migration tab on the spreadsheet. Again this will be a "it's been X time and nobody's contacted me" fallback; I will work with anyone who wants to to bring their data across in non-default ways, and the path I've laid out above doesn't require us to do that before moving to the new vocabulary/structure. I can also retain the old values in remarks, so we can revisit anything that didn't get migrated as it should have.

FWIW in looking for "ID of kin" I scrolled by "relationship" about 8 times - perhaps that term could still use some help (or maybe it's just me...).

For ID of kin, there may not be an explicitly stated relationship to another individual. The relationship may be to others in a captive breeding population, and we only have some of those individuals cataloged in Arctos, and we don't have all the familiar relationships tracked.

I can't readily enforce such things (and don't really want to), but without a relationship the 'relationship' identification is less than useless. We are asserting that this is the same as something else, but you can't have any of that because we're not going to tell you what it is. I very strongly recommend we add the caution regarding relationships back to the documentation.

I don't think any of the reasons given are barriers - nests and eggs can be cataloged and referenced, sightings can be cataloged as observations, reference material in other collections can be linked (or at least given IDs, which will presumably become links when the other collection gets better technology), etc.

In the case @campmlc mentioned, I'd probably use 'unknown' and elaborate in remarks: "This specimen was not examined, but it is the sibling of ABCXYZ which was examined using......"

Can we use 'fine traits' and 'coarse traits' instead? We use trait-based data all the time, and I think (?) that is clearer for natural history phenotypic information but does it work for cultural collections @AJLinn ?

"Traits" instead of "features" should be fine for us. Any of these new changes will require some re-training, but that's just fine with me if it gives us more clear explanations! Just curious... if we go thru and make changes/updates to the nature of ID based on these new categories, does Arctos retain a history of those changes so we can see "it used to be IDed as a toggle but someone with a high confidence changed it to be a belt fastener on this date"? I assume so, but I don't know how to get at that info.

@AJLinn ID history is preserved, but the old terminology will only be preserved in remarks.

http://arctos.database.museum/guid/MSB:Mamm:55245 has a bunch of IDs, all in agreement with each other.

Screen Shot 2019-07-25 at 9 55 25 AM

I can certainly 'unaccept' the current ID and add one as part of the migration process if that's somehow useful to you.

@AJLinn ID history is preserved, but the old terminology will only be preserved in remarks.

http://arctos.database.museum/guid/MSB:Mamm:55245 has a bunch of IDs, all in agreement with each other.

In that example, those additional IDs were inserted by "Add Determination", right? Is there a best practice for making changes, by "Edit an Existing Determination" vs. "Add Determination"? Will the most current & highest confidence be privileged over the later, lower confidence ID if they're just added vs. editing? If we use edit to change, we would manually enter the change history in remarks?

inserted by "Add Determination", right?

Yes, same pathway/structure anyway - they were likely created with the citation bulkloader.

"Edit an Existing Determination" vs. "Add Determination"

If you're not changing anything fundamental, edit is probably OK. If you're changing something fundamental, or adding new opinions (including those that agree with current opinions), add is better. I don't think adding can ever be "wrong."

You can choose "accept" (or not) when creating an ID. A new accepted will push everything else to unaccepted; nothing manual is required.

You can explicitly choose which ID is accepted - eg, keeping an older or lower-confidence ID as accepted is fine, that's 100% a curatorial call.

I'm going to next task and implement per https://github.com/ArctosDB/arctos/issues/2170#issuecomment-514784485 unless someone stops me soonish, lest this falls through the cracks again.

The table structure is in place and the single-record form can access identification_confidence. I'll get the other Arctos forms next.

After that I'll add https://docs.google.com/spreadsheets/d/1JfbcpVYTK73DKRkgJQ0jNrFziS9zcbx7lEq872WrfYs/edit#gid=115578899 to the code table, and then I intend to disallow usage of existing terms - I can't just get rid of them until we've migrated everything. That's probably going to be a bit disruptive for things currently in the bulkloader - please let me know if I can help update anything.

Can we NOT implement identification_confidence in the bulkloader for the time being? It should be there, but if there is not an immediate need I would like to delay that until we get to a new system so I don't have to rebuild a huge amount of complex code in two languages. If there is an immediate need I'll just deal with it.

"legacy terms" are now blocked from new ~usage~ insert; they can still be used for update (necessary to allow adding identifications, which implicitly updates all existing IDs). I'm increasing priority of this; the longer we put off finalizing migration, the more chances that will have to cause problems.

/remind me, check back in two weeks

I am taking migration requests. I'll do something hopefully-not-too-random (and leave verbose remarks) with the legacy data if I don't receive specific instructions in the next two weeks.

Here's a very tentative plan. Please let me know if I can make any of these updates globally (for all identifications in all collections) now; some seem straightforward, but perhaps I'm missing something.

ID of kin --> relationship
ID to species group --> unknown
curatorial --> unknown
erroneous citation --> unknown
expert --> unknown
field --> unknown
legacy --> unknown - CAN I DO THIS NOW?
molecular data--> molecular - CAN I DO THIS NOW?
photograph --> unknown
published referral --> unknown
~revised taxonomy --> taxonomic revision - CAN I DO THIS NOW?~
student --> unknown
type specimen --> unknown

Dusty,

I need all the IDs in my collections with nature of id 'expert' ->
confidence = 'high'

& why are you planning on changing 'revised taxonomy' back to 'taxonomic
revision'????? We changed 'taxonomic revision' to 'revised taxonomy' for
very good reasons!! They should stay 'revised taxonomy'!!

Re: molecular data-> molecular.... sure, why not

re: legacy-> unknown .... I suppose so.

I also want all my nature of id 'student' to have confidence = medium

& nature of ID 'type specimen' should not become 'unknown' !! those are the
HIGHEST confidence IDs we can possibly have, since the name they are given
is correct, by definition.

-Derek

On Wed, Oct 2, 2019 at 10:55 AM dustymc notifications@github.com wrote:

"legacy terms" are now blocked from new usage insert; they can still be
used for update (necessary to allow adding identifications, which
implicitly updates all existing IDs). I'm increasing priority of this; the
longer we put off finalizing migration, the more chances that will have to
cause problems.

!remind me, check back in two weeks

I am taking migration requests. I'll do something hopefully-not-too-random
(and leave verbose remarks) with the legacy data if I don't receive
specific instructions in the next two weeks.

Here's a very tentative plan. Please let me know if I can make any of
these updates globally (for all identifications in all collections) now;
some seem straightforward, but perhaps I'm missing something.

ID of kin --> relationship
ID to species group --> unknown
curatorial --> unknown
erroneous citation --> unknown
expert --> unknown
field --> unknown
legacy --> unknown - CAN I DO THIS NOW?
molecular data--> molecular - CAN I DO THIS NOW?
photograph --> unknown
published referral --> unknown
revised taxonomy --> taxonomic revision - CAN I DO THIS NOW?
student --> unknown
type specimen --> unknown

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM6FFRK4QL6YEVDHCFLQMTVDTA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAFZYYA#issuecomment-537631840,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM7NCXIJKLQ5XLDXMXTQMTVDTANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

nature of id 'expert' ->confidence = 'high'

I need a new value for nature - that's not a change, it's an addition.

nature of id 'student' to have confidence = medium

ditto

hanging 'revised taxonomy' back to 'taxonomic revision'?????

That's what's in the spreadsheet: https://docs.google.com/spreadsheets/d/1JfbcpVYTK73DKRkgJQ0jNrFziS9zcbx7lEq872WrfYs/edit#gid=115578899

'type specimen' should not become 'unknown' !!

What should they become then? _I_ certainly don't know how they were made!

HIGHEST confidence

That's a separate issue. "We don't know where this came from (nature_of_id=unknown), but it can't possibly be wrong (confidence=high)" fits in the new model.

D,

I just don't want to lose nature of ID = expert BEFORE we set confidence
for those IDs to High, otherwise, how would we do so??

& ditto for nature of ID = student

Re: revised taxonomy.... what argument was made to change this back to
taxonomic revision?? I'm sure there's a record of all the discussion on
this but in a nutshell, 'revised taxonomy' is better because the term
'revision' has a specific meaning in taxonomy and not all taxonomic changes
come from taxonomic revisions. Thus it is confusing and misleading to use
'taxonomic revision'. Change the table.

re: type specimens - if we don't want to keep this in nature of ID then it
should be in the confidence field. It would represent a step above 'high',
at least for primary types... My concern is there must be specimens with
nature of ID = type specimen that are not linked to publications via a
citation as 'types' & thus we'll loose this information. Perhaps it should
be added to ID remarks to ensure it's not lost?

-Derek

On Wed, Oct 2, 2019 at 11:07 AM dustymc notifications@github.com wrote:

nature of id 'expert' ->confidence = 'high'

I need a new value for nature - that's not a change, it's an addition.

nature of id 'student' to have confidence = medium

ditto

hanging 'revised taxonomy' back to 'taxonomic revision'?????

That's what's in the spreadsheet:
https://docs.google.com/spreadsheets/d/1JfbcpVYTK73DKRkgJQ0jNrFziS9zcbx7lEq872WrfYs/edit#gid=115578899

'type specimen' should not become 'unknown' !!

What should they become then? I certainly don't know how they were made!

HIGHEST confidence

That's a separate issue. "We don't know where this came from
(nature_of_id=unknown), but it can't possibly be wrong (confidence=high)"
fits in the new model.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM3K57B7FJWHKWTEPCLQMTWQVA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAF24EA#issuecomment-537636368,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM622CESTW6XL4WHORLQMTWQVANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

D - I'd prefer to set them all at once so they don't have an opportunity to get mixed up with any 'legacy' values.

UPDATE identification SET nature=newthing, confidence=whatever WHERE {anything I can get from existing data - collection, agent, nature, dates, publications, whatever}

is my strong preference.

revised taxonomy.... what argument

I have none - I'm just copying from the spreadsheet!

My concern is there must be specimens with
nature of ID = type specimen that are not linked to publications via a
citation as 'types'

Very possible, and I'd interpret that the other way - there's no publication, this therefore can't be a type, someone used the wrong nature.

added to ID remarks

Perhaps instead of by-request I should just add the original to remarks for everything that gets updated from this. It's no problem to do so.

/remind me to finalize this in 2 weeks

@dustymc set a reminder for Oct 16th 2019

Perhaps instead of by-request I should just add the original to remarks

for everything that gets updated from this. It's no problem to do so.

yes - sounds good

 there's no publication, this therefore can't be a type, someone used

the wrong nature.

Illogical. Nothing prevents people from failing to enter publications for
real types.

& re: revised taxonomy. Just fix the spreadsheet

-D

On Wed, Oct 2, 2019 at 11:25 AM dustymc notifications@github.com wrote:

D - I'd prefer to set them all at once so they don't have an opportunity
to get mixed up with any 'legacy' values.

UPDATE identification SET nature=newthing, confidence=whatever WHERE {anything I can get from existing data - collection, agent, nature, dates, publications, whatever}

is my strong preference.

revised taxonomy.... what argument

I have none - I'm just copying from the spreadsheet!

My concern is there must be specimens with
nature of ID = type specimen that are not linked to publications via a
citation as 'types'

Very possible, and I'd interpret that the other way - there's no
publication, this therefore can't be a type, someone used the wrong nature.

added to ID remarks

Perhaps instead of by-request I should just add the original to remarks
for everything that gets updated from this. It's no problem to do so.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM6C36B7BQ5MOLJ2X53QMTYRZA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAF4LZI#issuecomment-537642469,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM2ZRDBVJ2W66F5ZFLLQMTYRZANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

Unless someone explicitly directs me otherwise, all updates will include...

AND id_remarks+=Original Nature_of_ID: {oldvalue}

fix the spreadsheet

I don't think that's my call - those values (which have already been implemented) got there by consensus (maybe...). I don't know if someone feels that the current term is better for some reason, or it's just arbitrary. I think @jldunnum had something to say at some point?? @ccicero @campmlc you edited the spreadsheet??

I have no opinion - I think they're saying the same THING, I think it's a straight update (won't require changing confidence), should be easy to update when I figure out how we got here.

Just to reiterate what I wrote previously:

Re: revised taxonomy.... what argument was made to change this back to
taxonomic revision?? I'm sure there's a record of all the discussion on
this but in a nutshell, 'revised taxonomy' is better because the term
'revision' has a specific meaning in taxonomy and not all taxonomic changes
come from taxonomic revisions. Thus it is confusing and misleading to use
'taxonomic revision'. Change the table.

& the history was that this value had been 'taxonomic revision' since
forever... and I argued, and others agreed, to change it to 'revised
taxonomy' so it's baffling to me that people want to change it back & I'd
like to hear what reasoning was behind that?

-Derek

On Wed, Oct 2, 2019 at 11:36 AM dustymc notifications@github.com wrote:

Unless someone explicitly directs me otherwise, all updates will include...

AND id_remarks+=Original Nature_of_ID: {oldvalue}

fix the spreadsheet

I don't think that's my call - those values (which have already been
implemented) got there by consensus (maybe...). I don't know if someone
feels that the current term is better for some reason, or it's just
arbitrary. I think @jldunnum https://github.com/jldunnum had something
to say at some point?? @ccicero https://github.com/ccicero @campmlc
https://github.com/campmlc you edited the spreadsheet??

I have no opinion - I think they're saying the same THING, I think it's a
straight update (won't require changing confidence), should be easy to
update when I figure out how we got here.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM5KVMX6ONOYQRDNTNTQMTZ4HA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAF5LDI#issuecomment-537646477,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUMZQBDQBANCDPCHTDT3QMTZ4HANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

I finally found that thread

https://github.com/ArctosDB/arctos/issues/1093

Looks like we were all agreed on "revised taxonomy" at some point - not a clue what happened in the meantime.

Unless someone stops me in the very near future I will

  • change all taxonomic revision to revised taxonomy
  • allow revised taxonomy for new IDs
  • remove taxonomic revision from the code table

Bueno?

Is this why I'm getting this error right now when reiding as Revised taxonomy?
Legacy nature_of_id terms are disallowed; see https://github.com/ArctosDB/arctos/issues/2170
[Macromedia][Oracle JDBC Driver][Oracle]ORA-20001: Legacy nature_of_id terms are disallowed; see https://github.com/ArctosDB/arctos/issues/2170 ORA-06512: at "UAM.TEMP_TR_ID_BIU", line 4 ORA-04088: error during execution of trigger 'UAM.TEMP_TR_ID_BIU'
Error Executing Database Query.
Check the Arctos Handbookhttp://handbook.arctosdb.org/documentation/understanding-errors.html for more information on errors.


Jonathan L. Dunnum Ph.D.
Senior Collection Manager
Division of Mammals, Museum of Southwestern Biology
University of New Mexico
Albuquerque, NM 87131
(505) 277-9262
Fax (505) 277-1351

MSB Mammals website: http://www.msb.unm.edu/mammals/index.html
Facebook: http://www.facebook.com/MSBDivisionofMammals

Shipping Address:
Museum of Southwestern Biology
Division of Mammals
University of New Mexico
CERIA Bldg 83, Room 204
Albuquerque, NM 87131


From: dustymc notifications@github.com
Sent: Wednesday, October 2, 2019 2:45 PM
To: ArctosDB/arctos arctos@noreply.github.com
Cc: Jonathan Dunnum jldunnum@unm.edu; Mention mention@noreply.github.com
Subject: Re: [ArctosDB/arctos] nature of ID (#2170)

I finally found that thread

1093https://github.com/ArctosDB/arctos/issues/1093

Looks like we were all agreed on "revised taxonomy" at some point - not a clue what happened in the meantime.

Unless someone stops me in the very near future I will

  • change all taxonomic revision to revised taxonomy
  • allow revised taxonomy for new IDs
  • remove taxonomic revision from the code table

Bueno?

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHubhttps://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=AED2PA6F5VEMBO4CIDWH4N3QMUB7LA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAGD2FY#issuecomment-537672983, or mute the threadhttps://github.com/notifications/unsubscribe-auth/AED2PA32OON6JQFZK2QEZPLQMUB7LANCNFSM4IEIGLOA.

Bueno!!

-Derek

On Wed, Oct 2, 2019 at 12:45 PM dustymc notifications@github.com wrote:

I finally found that thread

1093 https://github.com/ArctosDB/arctos/issues/1093

Looks like we were all agreed on "revised taxonomy" at some point - not a
clue what happened in the meantime.

Unless someone stops me in the very near future I will

  • change all taxonomic revision to revised taxonomy
  • allow revised taxonomy for new IDs
  • remove taxonomic revision from the code table

Bueno?

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM5NZIBEDXJE5D2QXH3QMUB7JA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAGD2FY#issuecomment-537672983,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM5YP7XCYVZRQTYCTFDQMUB7JANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

The rule is flipped - you can use "revised taxonomy" now. I'll update the documentation, flip the values (plus remark), and delete taxonomic revision ~now-ish.

taxonomic revision is no more

awesome

On Wed, Oct 2, 2019 at 1:31 PM dustymc notifications@github.com wrote:

taxonomic revision is no more

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUMYV22E5RKWIYZBEWI3QMUHMBA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAGIARQ#issuecomment-537690182,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM3F4MTO53A6S4WFAKDQMUHMBANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

I am getting that same error for Expert as Nature of ID and High confidence.
Can we not change identifications right now?

On Wed, Oct 2, 2019 at 1:45 PM dustymc notifications@github.com wrote:

I finally found that thread

1093 https://github.com/ArctosDB/arctos/issues/1093

Looks like we were all agreed on "revised taxonomy" at some point - not a
clue what happened in the meantime.

Unless someone stops me in the very near future I will

  • change all taxonomic revision to revised taxonomy
  • allow revised taxonomy for new IDs
  • remove taxonomic revision from the code table

Bueno?

—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ABCJF4KWYIPBQBICEFXBCL3QMUB7JA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAGD2FY#issuecomment-537672983,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ABCJF4MR5MK2NLEKZ7YT5KLQMUB7JANCNFSM4IEIGLOA
.

--
Carol L. Spencer, Ph.D.
Staff Curator of Herpetology & Researcher
Museum of Vertebrate Zoology
3101 Valley Life Sciences Building
University of California, Berkeley, CA, USA 94720-3160
[email protected] or [email protected]
510-643-5778
http://mvz.berkeley.edu/

You can change IDs, but you can't use the terms which are going away very soon. http://arctos.database.museum/info/ctDocumentation.cfm?table=CTNATURE_OF_ID is currently the best source of information.

Let me if I can help update any of your data with a little more finesse than https://github.com/ArctosDB/arctos/issues/2170#issuecomment-537631840

Thanks Dusty!!!

On Wed, Oct 2, 2019 at 5:13 PM dustymc notifications@github.com wrote:

You can change IDs, but you can't use the terms which are going away very
soon.
http://arctos.database.museum/info/ctDocumentation.cfm?table=CTNATURE_OF_ID
is currently the best source of information.

Let me if I can help update any of your data with a little more finesse
than #2170 (comment)
https://github.com/ArctosDB/arctos/issues/2170#issuecomment-537631840

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ABCJF4N4OEO2MXU6KDOSWTDQMU2MNA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAGSI3Y#issuecomment-537732207,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ABCJF4N37YRZE2PFFCA3KVDQMU2MNANCNFSM4IEIGLOA
.

--
Carol L. Spencer, Ph.D.
Staff Curator of Herpetology & Researcher
Museum of Vertebrate Zoology
3101 Valley Life Sciences Building
University of California, Berkeley, CA, USA 94720-3160
[email protected] or [email protected]
510-643-5778
http://mvz.berkeley.edu/

here's the crosswalk:
ID of kin --> relationship
ID to species group --> unknown
curatorial --> unknown
erroneous citation --> unknown
expert --> FINE FEATURES
field --> COARSE FEATURES
legacy --> unknown - CAN I DO THIS NOW? yes
molecular data--> molecular - CAN I DO THIS NOW? yes
photograph --> COARSE FEATURES?
published referral --> unknown
revised taxonomy --> taxonomic revision - CAN I DO THIS NOW?
student --> unknown
type specimen --> unknown

ID of kin --> relationship
ID to species group --> unknown
curatorial --> FINE FEATURES
erroneous citation --> unknown
expert -->FINE FEATURES
field --> COARSE FEATURES
legacy --> unknown - CAN I DO THIS NOW? yes
molecular data--> molecular - CAN I DO THIS NOW? yes
photograph --> COARSE FEATURES
published referral --> unknown
revised taxonomy --> taxonomic revision - CAN I DO THIS NOW? yes
student --> COARSE FEATURES
type specimen --> FINE FEATURES

select guid_prefix,nature_of_id, count(*) c from collection,cataloged_item,identification where collection.collection_id=cataloged_item.collection_id and cataloged_item.collection_object_id=identification.collection_object_id group by guid_prefix,nature_of_id order by guid_prefix,nature_of_id;


GUID_PREFIX      NATURE_OF_ID                C
-------------------- ------------------------------ ----------
ALMNH:ES         expert                 19
ALMNH:ES         legacy              10351
ALMNH:ES         published referral              3
ALMNH:ES         revised taxonomy                2
ALMNH:ES         type specimen              16
APSU:Herp        legacy               7010
ASNHC:Mamm       field                   1
ASNHC:Mamm       legacy                135
BYU:Herp         field                1281
BYU:Herp         legacy                177
BYU:Herp         photograph                  8
CHAS:Bird        ID to species group             1
CHAS:Bird        expert                139
CHAS:Bird        field                   1
CHAS:Bird        legacy              21858
CHAS:Bird        revised taxonomy                1
CHAS:Bird        student                   230
CHAS:EH          curatorial                  4
CHAS:EH          expert                119
CHAS:EH          legacy                620
CHAS:EH          student                   281
CHAS:Egg         ID of kin                 679
CHAS:Egg         geographic distribution             3
CHAS:Egg         legacy               3337
CHAS:Egg         revised taxonomy                1
CHAS:Egg         student                     1
CHAS:Ento        ID of kin                   1
CHAS:Ento        curatorial                  1
CHAS:Ento        expert                 16
CHAS:Ento        legacy              32137
CHAS:Ento        revised taxonomy               15
CHAS:Ento        student                   564
CHAS:Fish        expert                  1
CHAS:Fish        legacy                 29
CHAS:Herb        expert                 52
CHAS:Herb        field                   9
CHAS:Herb        legacy                 99
CHAS:Herp        expert                  1
CHAS:Herp        legacy                  1
CHAS:Herp        student                     1
CHAS:Inv         curatorial                 34
CHAS:Inv         expert               1971
CHAS:Inv         legacy              25050
CHAS:Inv         published referral              1
CHAS:Inv         revised taxonomy              927
CHAS:Inv         student                    72
CHAS:Inv         type specimen             124
CHAS:Mamm        expert                 37
CHAS:Mamm        field                   2
CHAS:Mamm        geographic distribution             1
CHAS:Mamm        legacy               9912
CHAS:Mamm        revised taxonomy                8
CHAS:Mamm        student                    33
CHAS:Mamm        type specimen              17
CHAS:Teach       ID of kin                   1
CHAS:Teach       expert                 71
CHAS:Teach       field                   1
CHAS:Teach       legacy                954
CHAS:Teach       published referral              1
CHAS:Teach       student                   966
COA:Bird         legacy                878
COA:Egg          legacy                119
COA:Ento         legacy                 37
COA:Herp         legacy                 61
COA:Mamm         legacy                187
DGR:Bird         erroneous citation              1
DGR:Bird         expert                 22
DGR:Bird         field                 351
DGR:Bird         geographic distribution             1
DGR:Bird         legacy               2149
DGR:Bird         molecular data              1
DGR:Bird         revised taxonomy                1
DGR:Bird         type specimen               6
DGR:Ento         legacy                 35
DGR:Mamm         expert                131
DGR:Mamm         field                   2
DGR:Mamm         geographic distribution             3
DGR:Mamm         legacy                664
DGR:Mamm         molecular data             18
DGR:Mamm         revised taxonomy                3
DGR:Mamm         student                     1
DGR:Mamm         type specimen               1
DMNS:Bird        ID of kin                  23
DMNS:Bird        curatorial                 37
DMNS:Bird        erroneous citation             15
DMNS:Bird        expert               1938
DMNS:Bird        field                9106
DMNS:Bird        geographic distribution            42
DMNS:Bird        legacy              42312
DMNS:Bird        molecular data             26
DMNS:Bird        published referral              5
DMNS:Bird        revised taxonomy             3198
DMNS:Bird        student                     5
DMNS:Bird        type specimen              21
DMNS:Egg         field                 182
DMNS:Egg         legacy               6958
DMNS:Egg         published referral              1
DMNS:Egg         revised taxonomy              135
DMNS:Herp        curatorial                  2
DMNS:Herp        expert                 97
DMNS:Herp        field                  13
DMNS:Herp        geographic distribution             7
DMNS:Inv         ID of kin                  12
DMNS:Inv         erroneous citation              2
DMNS:Inv         expert                524
DMNS:Inv         geographic distribution             6
DMNS:Inv         legacy              13118
DMNS:Inv         published referral              3
DMNS:Inv         revised taxonomy             8415
DMNS:Inv         student                 10609
DMNS:Mamm        ID of kin                 185
DMNS:Mamm        ID to species group             5
DMNS:Mamm        curatorial                  4
DMNS:Mamm        expert               3764
DMNS:Mamm        field               15057
DMNS:Mamm        geographic distribution           102
DMNS:Mamm        legacy                544
DMNS:Mamm        molecular data            132
DMNS:Mamm        published referral             21
DMNS:Mamm        revised taxonomy             1108
DMNS:Mamm        student                    29
DMNS:Mamm        type specimen              48
DMNS:Para        expert                771
DMNS:Para        field                  11
DMNS:Para        molecular data              6
DMNS:Para        student                   123
HWML:Para        expert                  4
HWML:Para        legacy              24396
HWML:Para        type specimen            1711
KNWR:Ento        ID of kin                  14
KNWR:Ento        ID to species group             1
KNWR:Ento        erroneous citation              1
KNWR:Ento        expert               3432
KNWR:Ento        field                 223
KNWR:Ento        geographic distribution             3
KNWR:Ento        legacy                 59
KNWR:Ento        molecular data            565
KNWR:Ento        photograph                 59
KNWR:Ento        published referral              9
KNWR:Ento        revised taxonomy              282
KNWR:Ento        student                  8057
KNWR:Ento        type specimen              15
KNWR:Herb        ID of kin                   3
KNWR:Herb        expert               1087
KNWR:Herb        field                  93
KNWR:Herb        legacy                621
KNWR:Herb        molecular data              5
KNWR:Herb        photograph                  2
KNWR:Herb        revised taxonomy               25
KNWR:Herb        student                   721
KNWR:Inv         expert                 25
KNWR:Inv         field                   8
KNWR:Inv         molecular data              6
KNWR:Inv         revised taxonomy                4
KNWR:Inv         student                    26
KNWRObs:Fish         expert                 11
KNWRObs:Herb         field                 830
KWP:Ento         ID to species group           465
KWP:Ento         erroneous citation              2
KWP:Ento         expert              77860
KWP:Ento         geographic distribution           235
KWP:Ento         molecular data              7
KWP:Ento         photograph                259
KWP:Ento         revised taxonomy            34601
KWP:Ento         student                  9520
KWP:Ento         type specimen               1
MLZ:Bird         ID of kin                  21
MLZ:Bird         expert               7046
MLZ:Bird         geographic distribution           245
MLZ:Bird         legacy              62310
MLZ:Bird         published referral           1032
MLZ:Bird         revised taxonomy             2704
MLZ:Bird         type specimen             193
MLZ:Egg          ID of kin                   3
MLZ:Egg          expert                  7
MLZ:Egg          field                   7
MLZ:Egg          legacy                 21
MLZ:Herb         expert                 16
MLZ:Mamm         ID of kin                  10
MLZ:Mamm         expert                  7
MLZ:Mamm         geographic distribution            12
MLZ:Mamm         legacy               2157
MLZ:Mamm         revised taxonomy               25
MSB:Bird         ID of kin                   7
MSB:Bird         expert              13823
MSB:Bird         field                  83
MSB:Bird         legacy              24770
MSB:Bird         molecular data              4
MSB:Bird         revised taxonomy               30
MSB:Bird         student                  6165
MSB:Bird         type specimen              10
MSB:Fish         ID to species group          4648
MSB:Fish         expert              60797
MSB:Fish         field               31708
MSB:Fish         legacy               3122
MSB:Fish         molecular data              1
MSB:Fish         published referral              1
MSB:Fish         revised taxonomy               30
MSB:Fish         student                  3441
MSB:Fish         type specimen               8
MSB:Herp         ID of kin                  20
MSB:Herp         ID to species group             1
MSB:Herp         expert                165
MSB:Herp         field               18873
MSB:Herp         geographic distribution             1
MSB:Herp         legacy              13176
MSB:Herp         revised taxonomy                2
MSB:Herp         student                   102
MSB:Host         ID of kin                  17
MSB:Host         ID to species group             5
MSB:Host         curatorial                  6
MSB:Host         expert                207
MSB:Host         field                3225
MSB:Host         geographic distribution           216
MSB:Host         legacy              22510
MSB:Host         molecular data            247
MSB:Host         published referral              4
MSB:Host         revised taxonomy              753
MSB:Host         student                   168
MSB:Host         type specimen               1
MSB:Mamm         ID of kin                 691
MSB:Mamm         ID to species group             2
MSB:Mamm         curatorial              11815
MSB:Mamm         erroneous citation             45
MSB:Mamm         expert              11063
MSB:Mamm         field               72188
MSB:Mamm         geographic distribution           429
MSB:Mamm         legacy             223114
MSB:Mamm         molecular data           6079
MSB:Mamm         photograph                  7
MSB:Mamm         published referral             15
MSB:Mamm         revised taxonomy            18268
MSB:Mamm         student                 22218
MSB:Mamm         type specimen            4780
MSB:Para         ID of kin                  64
MSB:Para         ID to species group             6
MSB:Para         curatorial                 45
MSB:Para         expert               2494
MSB:Para         field                3492
MSB:Para         legacy              21186
MSB:Para         molecular data            415
MSB:Para         published referral              5
MSB:Para         revised taxonomy               13
MSB:Para         student                  2438
MSB:Para         type specimen              68
MSBObs:Mamm      ID of kin                   3
MVZ:Bird         ID of kin                   4
MVZ:Bird         ID to species group          4019
MVZ:Bird         coarse features                 9
MVZ:Bird         expert               1327
MVZ:Bird         field                3807
MVZ:Bird         geographic distribution         11466
MVZ:Bird         legacy             180916
MVZ:Bird         molecular data             56
MVZ:Bird         published referral             11
MVZ:Bird         revised taxonomy            54185
MVZ:Bird         student                     2
MVZ:Bird         type specimen             482
MVZ:Egg          ID of kin                   1
MVZ:Egg          ID to species group           247
MVZ:Egg          expert                 24
MVZ:Egg          field                 407
MVZ:Egg          geographic distribution          1010
MVZ:Egg          legacy              14386
MVZ:Egg          published referral             10
MVZ:Egg          revised taxonomy             3315
MVZ:Fish         field                   1
MVZ:Herp         ID of kin                  11
MVZ:Herp         ID to species group           182
MVZ:Herp         curatorial                  1
MVZ:Herp         erroneous citation              5
MVZ:Herp         expert              15964
MVZ:Herp         field               51074
MVZ:Herp         fine features               4
MVZ:Herp         geographic distribution          3045
MVZ:Herp         legacy             228836
MVZ:Herp         molecular data            227
MVZ:Herp         photograph                  4
MVZ:Herp         published referral            699
MVZ:Herp         revised taxonomy            35319
MVZ:Herp         student                    18
MVZ:Herp         type specimen            4389
MVZ:Hild         expert                  3
MVZ:Hild         field                  72
MVZ:Hild         legacy                585
MVZ:Hild         revised taxonomy                6
MVZ:Mamm         ID of kin                1466
MVZ:Mamm         ID to species group             1
MVZ:Mamm         curatorial                  3
MVZ:Mamm         erroneous citation              1
MVZ:Mamm         expert              40893
MVZ:Mamm         field               15624
MVZ:Mamm         geographic distribution          3713
MVZ:Mamm         legacy             193130
MVZ:Mamm         molecular data            530
MVZ:Mamm         published referral             89
MVZ:Mamm         revised taxonomy            20240
MVZ:Mamm         student                   858
MVZ:Mamm         type specimen            1844
MVZObs:Bird      ID to species group             3
MVZObs:Bird      expert                  3
MVZObs:Bird      field                4621
MVZObs:Bird      geographic distribution          1565
MVZObs:Bird      legacy                238
MVZObs:Bird      revised taxonomy              524
MVZObs:Herp      expert                  3
MVZObs:Herp      field                  19
MVZObs:Herp      photograph                  6
MVZObs:Herp      type specimen               1
MVZObs:Mamm      field                 101
NBSB:Bird        field                 321
NMU:Bird         ID of kin                   3
NMU:Bird         field                 308
NMU:Bird         legacy                  9
NMU:Mamm         ID of kin                  58
NMU:Mamm         expert                 70
NMU:Mamm         field                1203
NMU:Mamm         geographic distribution            95
NMU:Mamm         legacy               1026
NMU:Mamm         molecular data             87
NMU:Mamm         student                     2
NMU:Para         field                   7
OWU:Bird         curatorial                  2
OWU:Bird         student                    26
OWU:ES           curatorial                  3
OWU:ES           expert                 52
OWU:ES           field                   4
OWU:ES           legacy                297
OWU:ES           revised taxonomy               25
OWU:Fish         curatorial                 16
OWU:Fish         field                   1
OWU:Fish         student                     5
OWU:Mamm         ID of kin                   7
OWU:Mamm         curatorial                  7
OWU:Mamm         field                   1
OWU:Mamm         legacy                153
OWU:Mamm         student                     2
OWU:Para         curatorial                  2
OWU:Para         expert                  3
OWU:Rept         curatorial                  7
OWU:Rept         field                  81
OWU:Rept         legacy                137
UAM:Alg          legacy               4639
UAM:Arc          ID of kin               22859
UAM:Arc          expert               8502
UAM:Arc          field                 301
UAM:Arc          legacy             615096
UAM:Arc          student                   558
UAM:Art          expert                422
UAM:Art          legacy               3873
UAM:Bird         expert               1135
UAM:Bird         field               29489
UAM:Bird         legacy               7055
UAM:Bird         revised taxonomy                2
UAM:Bird         type specimen             156
UAM:EH           ID of kin                 890
UAM:EH           curatorial                 85
UAM:EH           expert                144
UAM:EH           field                   1
UAM:EH           legacy              14653
UAM:EH           photograph                  1
UAM:EH           student                   175
UAM:ES           ID of kin                  39
UAM:ES           erroneous citation              1
UAM:ES           expert              27465
UAM:ES           field               20062
UAM:ES           molecular data             14
UAM:ES           photograph                  1
UAM:ES           published referral              2
UAM:ES           revised taxonomy              292
UAM:ES           student                  7916
UAM:ES           type specimen               7
UAM:Ento         ID of kin                 447
UAM:Ento         ID to species group             1
UAM:Ento         coarse features                13
UAM:Ento         curatorial                753
UAM:Ento         erroneous citation             45
UAM:Ento         expert             279626
UAM:Ento         field                3018
UAM:Ento         geographic distribution           783
UAM:Ento         legacy              10861
UAM:Ento         molecular                   2
UAM:Ento         molecular data           1241
UAM:Ento         photograph                 47
UAM:Ento         published referral             38
UAM:Ento         revised taxonomy            21669
UAM:Ento         student                236131
UAM:Ento         type specimen             103
UAM:Env          student                   334
UAM:Fish         ID of kin                  25
UAM:Fish         expert               1980
UAM:Fish         field                1422
UAM:Fish         geographic distribution             4
UAM:Fish         legacy               6841
UAM:Fish         molecular data             20
UAM:Fish         revised taxonomy              472
UAM:Fish         student                    40
UAM:Fish         type specimen               3
UAM:Herb         ID of kin                 146
UAM:Herb         curatorial                  1
UAM:Herb         erroneous citation             10
UAM:Herb         expert              15290
UAM:Herb         field                2173
UAM:Herb         geographic distribution             3
UAM:Herb         legacy             201740
UAM:Herb         molecular data             24
UAM:Herb         revised taxonomy             2267
UAM:Herb         student                  9338
UAM:Herb         type specimen             515
UAM:Herp         ID of kin                   1
UAM:Herp         expert                 21
UAM:Herp         field                 383
UAM:Herp         geographic distribution             2
UAM:Herp         legacy                 35
UAM:Herp         molecular data              2
UAM:Herp         revised taxonomy              183
UAM:Herp         student                    44
UAM:Herp         type specimen               2
UAM:Inv          ID of kin                   9
UAM:Inv          expert                 89
UAM:Inv          field                 306
UAM:Inv          geographic distribution             4
UAM:Inv          legacy              19832
UAM:Inv          revised taxonomy                8
UAM:Inv          student                    56
UAM:Inv          type specimen              21
UAM:Mamm         ID of kin                 396
UAM:Mamm         curatorial                219
UAM:Mamm         erroneous citation              1
UAM:Mamm         expert               2867
UAM:Mamm         field               46898
UAM:Mamm         geographic distribution           655
UAM:Mamm         legacy              62850
UAM:Mamm         molecular data            654
UAM:Mamm         published referral              3
UAM:Mamm         revised taxonomy            20784
UAM:Mamm         student                 29046
UAM:Mamm         type specimen            6710
UAMObs:Bird      field                 164
UAMObs:Ento      ID of kin                  12
UAMObs:Ento      ID to species group             1
UAMObs:Ento      erroneous citation             22
UAMObs:Ento      expert              14216
UAMObs:Ento      field                  14
UAMObs:Ento      geographic distribution           129
UAMObs:Ento      legacy              18638
UAMObs:Ento      molecular                   1
UAMObs:Ento      molecular data           3168
UAMObs:Ento      photograph                 30
UAMObs:Ento      published referral           1981
UAMObs:Ento      revised taxonomy             1982
UAMObs:Ento      student                  1768
UAMObs:Ento      type specimen            1353
UAMObs:Fish      field                   6
UAMObs:Fish      photograph                  7
UAMObs:Mamm      ID of kin                   1
UAMObs:Mamm      curatorial                  1
UAMObs:Mamm      field                 212
UAMObs:Mamm      photograph                 20
UAMObs:Mamm      student                    20
UAMObs:Mamm      type specimen               7
UAMb:Herb        expert              11160
UAMb:Herb        field                   9
UAMb:Herb        geographic distribution             1
UAMb:Herb        legacy              52444
UAMb:Herb        photograph                  1
UAMb:Herb        student                   208
UCM:Bird         ID of kin                   1
UCM:Bird         curatorial                  3
UCM:Bird         erroneous citation              1
UCM:Bird         expert               1073
UCM:Bird         field                   2
UCM:Bird         legacy              11122
UCM:Bird         revised taxonomy               35
UCM:Bird         student                     1
UCM:Bird         type specimen               8
UCM:Egg          ID of kin                   1
UCM:Egg          expert                760
UCM:Egg          legacy               2436
UCM:Egg          revised taxonomy               18
UCM:Egg          student                   153
UCM:Fish         ID of kin                   5
UCM:Fish         ID to species group             1
UCM:Fish         expert                294
UCM:Fish         field                   1
UCM:Fish         legacy               2107
UCM:Fish         revised taxonomy               23
UCM:Fish         student                    11
UCM:Fish         type specimen              25
UCM:Herp         curatorial                 80
UCM:Herp         expert               3349
UCM:Herp         field                   1
UCM:Herp         geographic distribution           413
UCM:Herp         legacy              63160
UCM:Herp         photograph                  1
UCM:Herp         published referral             15
UCM:Herp         revised taxonomy            25751
UCM:Herp         student                     1
UCM:Herp         type specimen             510
UCM:Mamm         expert               1476
UCM:Mamm         field                  13
UCM:Mamm         geographic distribution            11
UCM:Mamm         legacy              19077
UCM:Mamm         molecular data            106
UCM:Mamm         published referral              3
UCM:Mamm         revised taxonomy              223
UCM:Mamm         student                    11
UCM:Mamm         type specimen              28
UCM:Obs          expert                 26
UCM:Obs          field                 205
UCM:Obs          photograph                 20
UCM:Obs          revised taxonomy               36
UCSC:Bird        legacy                175
UCSC:Herp        legacy                  1
UCSC:Mamm        field                 173
UCSC:Mamm        legacy                 24
UMNH:Bird        curatorial                 46
UMNH:Bird        expert                  5
UMNH:Bird        field                  41
UMNH:Bird        legacy              20741
UMNH:Bird        revised taxonomy                3
UMNH:Herp        field                   4
UMNH:Herp        legacy              22431
UMNH:Mamm        ID of kin                  52
UMNH:Mamm        curatorial                354
UMNH:Mamm        expert                 71
UMNH:Mamm        field                3055
UMNH:Mamm        geographic distribution            35
UMNH:Mamm        legacy              40169
UMNH:Mamm        published referral              1
UMNH:Mamm        revised taxonomy              133
UMNH:Mamm        student                     6
UMNH:Mamm        type specimen              18
UMZM:Bird        legacy               6161
UMZM:Mamm        legacy              15830
UNM:ES           ID of kin                   2
UNM:ES           expert                603
UNM:ES           legacy                205
UNM:ES           revised taxonomy               72
UNM:ES           student                   768
UNR:Bird         field                 198
UNR:Fish         ID of kin                  32
UNR:Fish         expert                 66
UNR:Fish         field                3028
UNR:Fish         legacy                205
UNR:Fish         student                   484
UNR:Herp         ID of kin                   3
UNR:Herp         expert                  7
UNR:Herp         field                3589
UNR:Herp         student                     1
UNR:Mamm         ID of kin                   1
UNR:Mamm         field                2016
USNPC:Para       expert                809
USNPC:Para       legacy                  1
USNPC:Para       type specimen               1
UTEP:Arc         expert                451
UTEP:Bird        expert                631
UTEP:Bird        legacy               2426
UTEP:Bird        student                    22
UTEP:ES          ID of kin                 604
UTEP:ES          erroneous citation              3
UTEP:ES          expert              23715
UTEP:ES          legacy                276
UTEP:ES          molecular data              1
UTEP:ES          revised taxonomy              138
UTEP:ES          student                    35
UTEP:ES          type specimen              18
UTEP:Ento        curatorial                  5
UTEP:Ento        expert                530
UTEP:Ento        legacy               7828
UTEP:Ento        student                    32
UTEP:Ento        type specimen              40
UTEP:Fish        expert                 93
UTEP:Fish        field                   4
UTEP:Fish        legacy                  1
UTEP:Fish        student                   147
UTEP:Herb        curatorial                 44
UTEP:Herb        erroneous citation              1
UTEP:Herb        expert              49407
UTEP:Herb        field                 104
UTEP:Herb        legacy              34787
UTEP:Herb        molecular data             47
UTEP:Herb        student                    36
UTEP:Herb        type specimen              15
UTEP:Herp        ID of kin                   4
UTEP:Herp        curatorial                613
UTEP:Herp        expert              10223
UTEP:Herp        field                 113
UTEP:Herp        geographic distribution            37
UTEP:Herp        legacy              11595
UTEP:Herp        molecular data            145
UTEP:Herp        revised taxonomy               24
UTEP:Herp        student                    11
UTEP:Herp        type specimen             336
UTEP:HerpOS      expert                572
UTEP:HerpOS      geographic distribution             2
UTEP:HerpOS      legacy               1166
UTEP:HerpOS      type specimen               1
UTEP:Inv         expert              10207
UTEP:Inv         legacy               1697
UTEP:Inv         type specimen              21
UTEP:Mamm        ID of kin                  20
UTEP:Mamm        expert               8517
UTEP:Mamm        geographic distribution            39
UTEP:Mamm        revised taxonomy                2
UTEP:Teach       expert                 62
UTEP:Teach       field                   1
UTEP:Teach       legacy                180
UTEP:Teach       revised taxonomy                2
UTEP:Teach       student                     2
UTEP:Zoo         expert                 17
UTEP:Zoo         student                   107
UTEP:Zoo         type specimen              27
UTEPObs:Ento         expert                  2
UTEPObs:Herp         ID of kin                   1
UTEPObs:Herp         expert                 75
UTEPObs:Herp         field                  20
UTEPObs:Herp         legacy                 73
UTEPObs:Herp         photograph                 22
UWBM:Herp        ID of kin                 897
UWBM:Herp        expert                568
UWBM:Herp        field                1141
UWBM:Herp        geographic distribution             2
UWBM:Herp        legacy               7754
UWBM:Herp        revised taxonomy               89
UWBM:Herp        student                   164
UWBM:Herp        type specimen               2
UWBM:Mamm        ID of kin                   3
UWBM:Mamm        ID to species group             2
UWBM:Mamm        expert                 14
UWBM:Mamm        field                   4
UWBM:Mamm        legacy              55660
UWYMV:Bird       ID of kin                   1
UWYMV:Bird       curatorial                 12
UWYMV:Bird       expert               2217
UWYMV:Bird       field                  52
UWYMV:Bird       geographic distribution           122
UWYMV:Bird       legacy                  3
UWYMV:Bird       student                    30
UWYMV:Egg        geographic distribution             1
UWYMV:Egg        student                     1
UWYMV:Fish       expert                539
UWYMV:Herp       curatorial                  2
UWYMV:Herp       expert                346
UWYMV:Herp       field                   1
UWYMV:Herp       legacy                  3
UWYMV:Mamm       curatorial                 19
UWYMV:Mamm       expert                  1
UWYMV:Mamm       field                   4
UWYMV:Mamm       geographic distribution          1112
UWYMV:Mamm       legacy               5222
WNMU:Bird        legacy               1339
WNMU:Fish        expert                 55
WNMU:Fish        legacy                324
WNMU:Mamm        expert                 16
WNMU:Mamm        legacy               6542
WNMU:Mamm        revised taxonomy              332

Find your types in USAGE!

Dusty will implement the migration outlined in Mariel's last comment in two weeks-- complain before then or make changes accordingly.

Seems Mariel didn't catch the discussion on this:

revised taxonomy --> taxonomic revision - CAN I DO THIS NOW? yes

The conclusions was NO, this is not going to happen. All 'taxonomic
revision' will become 'revised taxonomy', not the other way around.

-Derek

On Thu, Oct 3, 2019 at 11:07 AM Michelle Koo notifications@github.com
wrote:

Find your types in USAGE!

Dusty will implement the migration outlined in Mariel's last comment in
two weeks-- complain before then or make changes accordingly.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUMZ4TYKRYI4UWY5LAQLQMY7IBA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAJH6GY#issuecomment-538083099,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM2YYHLX2E6Z7GCKPNDQMY7IBANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

Revised taxonomy is done.

Current values will be preserved in remarks.

Relevant code tables are https://arctos.database.museum/info/ctDocumentation.cfm?table=CTNATURE_OF_ID and https://arctos.database.museum/info/ctDocumentation.cfm?table=CTIDENTIFICATION_CONFIDENCE

These will happen ~today unless someone stops me:

  • legacy --> unknown
  • molecular data--> molecular

These will happen for anything that hasn't been handled about Oct 16th 2019. PLEASE comment here ASAP if you'd like a different migration path (eg, one that does something with 'confidence') for some or all of your collection.

  • ID of kin --> relationship
  • ID to species group --> unknown
  • curatorial --> FINE FEATURES
  • erroneous citation --> unknown
  • expert -->FINE FEATURES
  • field --> COARSE FEATURES
  • photograph --> COARSE FEATURES
  • published referral --> unknown
  • student --> COARSE FEATURES
  • type specimen --> FINE FEATURES

Good.

For UAM:Ento, UAMObs:Ento, KWP:Ento, KNWR:Ento, and KNWR:Inv

I would like confidence set to 'high' for any identifications with nature
of ID = expert
&
I would like confidence set to 'medium' for any identifications with nature
of ID = student

I am not sure about this migration:

expert -->FINE FEATURES
student --> COARSE FEATURES

I think this is incorrect - lots of student IDs are based on 'fine
features'; similar problem with photograph... plenty of IDs based on photos
are of fine features (eg SEM or any photos taken through microscopes...)

I'm a little worried that we'll be forced to make distinctions between FINE
& COARSE that are over-precise and subjective. It would be good to have a
3rd option "PHENOTYPE" or "MORPHOLOGY" or perhaps "EXTERNAL FEATURES" for
cases where we don't know if the ID was based on FINE or COARSE features.
Sorry if this sets us back a bit.

-Derek

On Thu, Oct 3, 2019 at 11:54 AM dustymc notifications@github.com wrote:

Revised taxonomy is done.

Current values will be preserved in remarks.

Relevant code tables are
https://arctos.database.museum/info/ctDocumentation.cfm?table=CTNATURE_OF_ID
and
https://arctos.database.museum/info/ctDocumentation.cfm?table=CTIDENTIFICATION_CONFIDENCE

These will happen ~today unless someone stops me:

  • legacy --> unknown
  • molecular data--> molecular

These will happen for anything that hasn't been handled about Oct 16th

  1. PLEASE comment here ASAP if you'd like a different migration path
    (eg, one that does something with 'confidence') for some or all of your
    collection.
  • ID of kin --> relationship
  • ID to species group --> unknown
  • curatorial --> FINE FEATURES
  • erroneous citation --> unknown
  • expert -->FINE FEATURES
  • field --> COARSE FEATURES
  • photograph --> COARSE FEATURES
  • published referral --> unknown
  • student --> COARSE FEATURES
  • type specimen --> FINE FEATURES

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM56XRULEPTPDMKYG7TQMZEYVA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAJMUYY#issuecomment-538102371,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUMYJ4ZD723C4IEQLQE3QMZEYVANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

I don't disagree; my "morphology" proposal died ^^ way up there ^^ somewhere.

I'm not sure anything like SEM is required to get at "fine" - think keying out blue vs. fin whale skulls.

I don't like "external" - think "this adult bear femur is obviously a grizzly because it looks like one."

Given the existing overlap, another hopefully-temporary morphology-or-whatever(==we don't know what "fine" and "course" mean so we don't know if this should be "fine" or "course") option would not bother me too much, particularly if it lets us proceed.

Alternatively, I still like "unknown" (https://github.com/ArctosDB/arctos/issues/2170#issuecomment-537631840) for these. I suppose it's a matter of how much we don't know before we admit we don't know....

What about "external phenoype" (looks like moose") vs "anatomy" (internal morphology/structure) vs "microscopy"?

external phenoype

I think I like it.

"anatomy" vs "microscopy"

Are these different or just taxon-specific things? Folks who look at grasshopper genitalia under a microscope are doing about the same things (probably, I think) as those who look at large mammal femurs, it's just that the latter don't require magnification.

I think there's a place for "took 600 detailed measurements of this sea lion skull from the CT scan" - maybe "microscopy" works for that and we just need to be careful in the definition?

I think morphology fits all these. If I look at mouse teeth, is that
external or internal?

On Thu, Oct 3, 2019 at 4:54 PM dustymc notifications@github.com wrote:

external phenoype

I think I like it.

"anatomy" vs "microscopy"

Are these different or just taxon-specific things? Folks who look at
grasshopper genitalia under a microscope are doing about the same things
(probably, I think) as those who look at large mammal femurs, it's just
that the latter don't require magnification.

I think there's a place for "took 600 detailed measurements of this sea
lion skull from the CT scan" - maybe "microscopy" works for that and we
just need to be careful in the definition?

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBGZZ3UXEGH5FVBV4KTQMZZYNA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAJ2VDA#issuecomment-538159756,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ADQ7JBG7HFRIJDTRNXUKHODQMZZYNANCNFSM4IEIGLOA
.

I think external phenotype is better for a general 'looks like a moose' or "looks like a certain subspecies of a bird" scenario. Morphology doesn't really cover looking at plumage characteristics, for example. Definition: "Identification based on a coarse examination of external features including size, shape, and color."

I think anatomy would cover things like looking at teeth, bones, appendages, etc. - whether under a scope or not. Definition: "Identification based on a detailed look at structure such as teeth, bones, appendages, etc. whether under a microscope or not."

I was trying to figure out another word for 'microscopy' that would cover things like CT scans or something that requires specialized equipment. "Microscopy" probably isn't the best term. What about "Advanced Imaging" - Definition: "Identification based on advanced imaging technology, including CT scanning, Scanning Electron Microscopy (SEM), etc."

So these three terms would cover progressively more advanced, detailed examination of features for identification.

I'm still not quite seeing the split between anatomy and "microscopy." https://arctos.database.museum/guid/UAM:Mamm:11507 was done with calipers. There's a grant pending to do the same(ish) thing with CT/digital measuring. I don't think the results will be fundamentally different, so I don't think the equipment is a useful split.

Does "detailed measurements" (however we say it) work for the last category? So LOOKING (and maybe making some casual measurements), whether it's at a blue whale or a virus, is "anatomy," but carefully measuring characteristics is something else.

I think that works for more than morphology as well - quantitative colors, maybe some of the stuff that happens around bat detectors, testing artwork for caesium-137 (maybe that's molecular??), etc.

What about "general phenotype" for field or "casual" IDs (could include casual measurements, scale counts). I like the term phenotype because most field ids are based on that.

I agree we need something for more careful analysis, could be scans or quantitative color analysis etc. Something more advanced that contrasts with "general"

I like the wording of General Phenotype.
But since we are really just trying to get back to how we ID things in the
field (which are possibly by an expert but checked quickly with only some
or no reference material available), why can't we keep "field" as one
possibility?
"general phenotype" can be another.
It conveys a lot on how the animal or plant, etc was identified. Whereas
all the others our more specific about tools used, etc.

On Fri, Oct 4, 2019 at 9:28 AM Carla Cicero notifications@github.com
wrote:

What about "general phenotype" for field or "casual" IDs (could include
casual measurements, scale counts). I like the term phenotype because most
field ids are based on that.

I agree we need something for more careful analysis, could be scans or
quantitative color analysis etc. Something more advanced that contrasts
with "general"

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ABCJF4JJQQA2JQTEWOXGVGTQM5VJHA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAMGDWY#issuecomment-538468827,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ABCJF4LZ4WZIXB5ET2AOUO3QM5VJHANCNFSM4IEIGLOA
.

--
Carol L. Spencer, Ph.D.
Staff Curator of Herpetology & Researcher
Museum of Vertebrate Zoology
3101 Valley Life Sciences Building
University of California, Berkeley, CA, USA 94720-3160
[email protected] or [email protected]
510-643-5778
http://mvz.berkeley.edu/

Don't we have to consider cultural objects? Maybe this is why we settled on
'features' initially?

I think features is fine I just feel we need an alternative for when we
don't know if the examination was of coarse or fine features...

maybe 'feature examination' ? pretty vague but works & is better than
'unknown'

-Derek

On Fri, Oct 4, 2019 at 8:28 AM Carla Cicero notifications@github.com
wrote:

What about "general phenotype" for field or "casual" IDs (could include
casual measurements, scale counts). I like the term phenotype because most
field ids are based on that.

I agree we need something for more careful analysis, could be scans or
quantitative color analysis etc. Something more advanced that contrasts
with "general"

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM5ASUCXW6MDNS2HMHLQM5VJHA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAMGDWY#issuecomment-538468827,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM4UNEXAUY2ZC6IPYD3QM5VJHANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

I don't think "phenotype" works for all of Arctos. "This looks like a Clovis point" and "the 800 measurements that fell out of the CT analysis support this being a Clovis point" should work alongside biological material.

The objections yesterday arose around the split between "general" and "detailed" (whatever we ultimately call them). It would be good if the terms themselves included/excluded "sorta-detailed" things like scale count, but that can probably be in the definition as well.

field

I think that would fail to support what we're trying to do here. If we can get at HOW an ID was made, then people can search for cryptospecies (eg, things that look the same under some technique) and such. Saying where it was made (and I think "field" would be used for that, however it's defined) can't do that; people do drag microscopes to the field.

"Expertness" should be derived from the identifier's agent record and the new "confidence" field.

'feature examination' ? pretty vague but works & is better than 'unknown'

I think this is confounding some stuff.

  1. We should have clear terminology going forward; if you {whatever} then you need to use term {something}.
  2. We need a migration path. I don't think there's a universal landing spot for the stuff currently in nature of ID (that's the problem we're trying to fix). Expert, student, curatorial, etc., all could have come from any technique. From here, at the scope of Arctos, I have absolutely no way to say more; I just don't know, and that's what "unknown" says. You may know more - 'experts' in your collection always use 'feature examination' or something. If you have that (or are willing to guess - the existing data will be preserved in remarks, I don't think this has to be perfect) PLEASE let me know so I can write custom migration scripts and move those records now, so they're not caught up in the default/fallback migration.

I don't think "phenotype" works for all of Arctos.

Please no biologically-specific terms. "Features" was a much more inclusive term for all collections held in Arctos.

  • I think we need a 'general features' term, and it should be easy to adjust the name as long as we don't significantly alter the concept.
  • I think we need a 'fine features' term, and it should be easy to adjust the name as long as we don't significantly alter the concept.
  • Do we also need a "features" term that encompasses both of the above? I think this is the current blocker.

I second the request for a term that is inclusive of non-biological
collections.

On Fri, Oct 4, 2019 at 9:29 AM dustymc notifications@github.com wrote:

>

  • I think we need a 'general features' term, and it should be easy to
    adjust the name as long as we don't significantly alter the concept.
  • I think we need a 'fine features' term, and it should be easy to
    adjust the name as long as we don't significantly alter the concept.
  • Do we also need a "features" term that encompasses both of the
    above? I think this is the current blocker.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=AJKSRR5T2DAYPTUGWBVQYT3QM54ONA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAMLFTY#issuecomment-538489551,
or mute the thread
https://github.com/notifications/unsubscribe-auth/AJKSRR6CD7H6SLNPM2C5P5TQM54ONANCNFSM4IEIGLOA
.

--
Mareca Guthrie
Curator of Fine Arts & Associate Professor of Art
University of Alaska Museum of the North
1962 Yukon Drive
P.O. Box 756960
Fairbanks, AK 99775-6960
[email protected]

University of Alaska Museum of the North: www.uaf.edu/museum
UAF Art Department: https://www.uaf.edu/art/ https://www.uaf.edu/art/
Colors of Nature: http://www.colorsofnature.org/

Do we also need a "features" term that encompasses both of the above? I
think this is the current blocker.

yes - and for migration it's much safer and more appropriate to use this
'features' without specifying fine or coarse for nature of ID = expert and
student, even for 'field' because all such IDs sometimes use coarse and
sometimes use fine features. We cannot assume that experts use fine and
students use coarse. Experts can look at a moose femur in the woods and
know it's moose without a microscope, students can use microscopes etc.

-Derek

On Fri, Oct 4, 2019 at 9:29 AM dustymc notifications@github.com wrote:

>

  • I think we need a 'general features' term, and it should be easy to
    adjust the name as long as we don't significantly alter the concept.
  • I think we need a 'fine features' term, and it should be easy to
    adjust the name as long as we don't significantly alter the concept.
  • Do we also need a "features" term that encompasses both of the
    above? I think this is the current blocker.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM36PO6LY6HTQLSBQJLQM54OLA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAMLFTY#issuecomment-538489551,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM5FCQRMUKVGLYSA72TQM54OLANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

Done.

features - Identification based on examination of diagnostic traits that may include qualitiative assessment of morphology, coloration, structure, etc. Examination of features may be direct (specimen) or indirect (e.g., photograph). This value encompasses "fine features" and "coarse features" and should not be used if more specific information is available.

But I can't use 'features' for new IDs - getting that legacy terms error

On Fri, Oct 4, 2019 at 10:14 AM dustymc notifications@github.com wrote:

Done.

features - Identification based on examination of diagnostic traits that
may include qualitiative assessment of morphology, coloration, structure,
etc. Examination of features may be direct (specimen) or indirect (e.g.,
photograph). This value encompasses "fine features" and "coarse features"
and should not be used if more specific information is available.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM2NOFZZFWHHOM3ZAXDQM6BZTA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAMPJKA#issuecomment-538506408,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUM3FCY62A5LZ6KHS73DQM6BZTANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

You can now - one of the many reasons I don't want to let this hang out in never-never land for too long....

So are we down to just one term of "features"?

I thin it will be confusing to have three "features" term - general, fine, no modifier. I am fine with just the one term, to distinguish between that and molecular, audio-visual, relationships, etc

Dusty, can you provide the most current list and migration path?

No, we are up to three.

At least in theory I think we need more than one, but if there's no compelling immediate need I'm happy enough to drop 'fine' and 'course' and deal with that when someone needs it.

I don't think this changes anything about my migration path; there's still nothing about eg, 'expert' that might lead me to 'features' so I see no alternative but to map it to 'unknown.' I continue hoping that collections WILL have that information (or be willing to make educated guesses at it) and will direct me to better pathways before I have to fall back to shoving most everything into 'unknown' (and remarks). I think ID of kin --> relationship is currently the only unambiguous not-unknown path I have.

no, there are 3 terms:

fine features
coarse features
features

the last one is only to be used when one can't be sure on which of the
first two to apply

On Fri, Oct 4, 2019 at 11:06 AM Carla Cicero notifications@github.com
wrote:

So are we down to just one term of "features"?

I thin it will be confusing to have three "features" term - general, fine,
no modifier. I am fine with just the one term, to distinguish between that
and molecular, audio-visual, relationships, etc

Dusty, can you provide the most current list and migration path?

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUMZYY7UH4LU44Q6ZU63QM6H43A5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAMTS6Y#issuecomment-538524027,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUMYWCEBA4CUW5S2EYLTQM6H43ANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

In thinking more about this, what are we trying to accomplish by having three terms for essentially the same type of ID, just with more or less detail? I think that's going to cause confusion. That would be like breaking 'molecular' into different types of analysis (genomics, mtDNA sequences, etc.).

Can't we just have one term for 'features' - someone ID'd it based on external or internal features such as size, shape, color, etc. - and then deal with the details if someone wants in other fields, either ID_Remarks or attributes (measurements, color...) plus our new confidence score?

I think we need to step back a bit and ask what our goals are.

accomplish

In theory, I'm thinking

  • "someone looked at large-scale morphology, this could very well be a morphological cryptospecies, we might detect that with more detailed morphology."
  • same thing, but 'detailed morphology' --> genetics-or-something

I'm not sure anyone's ever tried to find specimens in that way (they've certainly failed if they tried) so as long as we're not losing data - and I don't think it's currently there to lose - I am fine with adding that when/if we have the actual need.

https://arctos.database.museum/guid/UAM:Mamm:11507 is an example of a "fine features" specimen, but that's not apparent from the current data, and I don't think we'd lose anything by merging it into 'features.' (And it's not so much "fine features' as 'precise measurements.')

Right. Also, looking at fine features is different than basing an ID on that. You can base the ID on its features, but then examine fine features and add those as other types of data such as attributes, media, etc. For identification, I think the important thing is to distinguish whether it was based on features, molecular data, audio-visual, distribution, etc.

Not to throw a new wrench into the works... but may the most important
thing we're trying to capture is:
FIRST, the confidence or some measure of that (preliminary id vs
verified)-- this is really the Nature of Id (quality)
SECOND, the method of how that id was made (fine, coarse, or external
(trying to get at something less biological than phenotype) vs molecular),
etc etc (more quantitative although the method may be based on quality or
gestalt or whatever, for ex., if Patton says he "feels" it's a particular
subspecies of gopher I'd have more confidence than if anyone else did)

If we think of it that way, then maybe we dont have to parse/ dissect /
enumerate terms like "Fine" which just is giving me hives

On Mon, Oct 7, 2019 at 8:38 AM Carla Cicero notifications@github.com
wrote:

Right. Also, looking at fine features is different than basing an ID on
that. You can base the ID on its features, but then examine fine features
and add those as other types of data such as attributes, media, etc. For
identification, I think the important thing is to distinguish whether it
was based on features, molecular data, audio-visual, distribution, etc.

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=AATH7UOHQJJ2PCN5LXRRHPLQNNJVVA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAQZXDI#issuecomment-539073421,
or mute the thread
https://github.com/notifications/unsubscribe-auth/AATH7UP7C3XSHDJFRHQHWZ3QNNJVVANCNFSM4IEIGLOA
.

@mkoo I think those things are all closely related, and that's part of what we're doing here.

  • Patton thinks it looks like a gopher based on some technique (nature_of_id) he's familiar with
  • Patton thinks it looks like a gopher based on some technique he's familiar with, but notes that the diagnostic features are mangled
  • Patton thinks it looks like a gopher based on some technique that he's never used before
  • Some student who came to know gophers exists about 5 minutes ago does the same things....

I think you need all of that to understand an identification. "Patton is an expert (and he may or may not have been working in familiar territory or with identifiable specimens)" doesn't say much.

yes, what I'm saying is:
nature of id= validated or high confidence or verified or whatever term we
like
method of id= Patton ran allozymes.

user= feels good that it's gopher X

or
nature of id= preliminary or low confidence or unvalidated
method of id= some student looked at external morphology while wrangling
out of its trap (i.e., "field")

user= hmm, (data) buyer beware

On Mon, Oct 7, 2019 at 9:00 AM dustymc notifications@github.com wrote:

@mkoo https://github.com/mkoo I think those things are all closely
related, and that's part of what we're doing here.

  • Patton thinks it looks like a gopher based on some technique
    (nature_of_id) he's familiar with
  • Patton thinks it looks like a gopher based on some technique he's
    familiar with, but notes that the diagnostic features are mangled
  • Patton thinks it looks like a gopher based on some technique that
    he's never used before
  • Some student who came to know gophers exists about 5 minutes ago
    does the same things....

I think you need all of that to understand an identification. "Patton is
an expert (and he may or may not have been working in familiar territory or
with identifiable specimens)" doesn't say much.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=AATH7UL6JKMAQK2VEHN2FUTQNNMLJA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAQ4LHI#issuecomment-539084189,
or mute the thread
https://github.com/notifications/unsubscribe-auth/AATH7UL2GT6KDC42XUAVWS3QNNMLJANCNFSM4IEIGLOA
.

That's what we're doing, but not with that vocabulary.

method ==> nature_of_id
validated or high confidence or verified or whatever term we like ==> identification_confidence

yea I'm saying flip that! we see the nature of id first and it's a quality
measurement not the method that's primary. Currently, we are trying to
implicitly derive the quality from the method which is not efficient and
where we get lost in the fine vs coarse dissections.

On Mon, Oct 7, 2019 at 9:36 AM dustymc notifications@github.com wrote:

That's what we're doing, but not with that vocabulary.

method ==> nature_of_id
validated or high confidence or verified or whatever term we like ==>
identification_confidence

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=AATH7UPBZMRAHN42EZWRPPTQNNQPNA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAQ74UA#issuecomment-539098704,
or mute the thread
https://github.com/notifications/unsubscribe-auth/AATH7UKLZTZ3ATKYFBVIAP3QNNQPNANCNFSM4IEIGLOA
.

  1. I don't think that's a blocker - we can flip labels whenever we want.
  2. Treating an explicitly-subjective NULLable field that many collections have said they won't use as "primary" doesn't make sense to me.

No, we would alter the crosswalk so it's not nullable... I'm talking about
a conceptual "flip" and moving our model of nature of id so that it better
captures confidence quality. Because that is implicitly what we are trying
to do with things like nature of id="field" vs "expert"

perhaps better to have as a priority conversation topic as I feel that
others will have opinions on this who are not part of this thread.

On Mon, Oct 7, 2019 at 9:49 AM dustymc notifications@github.com wrote:

>

  1. I don't think that's a blocker - we can flip labels whenever we
    want.
  2. Treating an explicitly-subjective NULLable field that many
    collections have said they won't use as "primary" doesn't make sense to me.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=AATH7UL2JN3SPRV7SWSWKMTQNNSCRA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEARBHLI#issuecomment-539104173,
or mute the thread
https://github.com/notifications/unsubscribe-auth/AATH7UP2N6DTTGN4YDHLUA3QNNSCRANCNFSM4IEIGLOA
.

I feel we're going backwards here.

We added the new field 'ID confidence' which clearly captures what was
mixed with 'id method' in 'nature of ID' - maybe we need to change the name
of the field from 'nature of id' to 'id method' to make it even more
explicit:

ID confidence
ID method

-Derek

On Mon, Oct 7, 2019 at 8:56 AM Michelle Koo notifications@github.com
wrote:

No, we would alter the crosswalk so it's not nullable... I'm talking about
a conceptual "flip" and moving our model of nature of id so that it better
captures confidence quality. Because that is implicitly what we are trying
to do with things like nature of id="field" vs "expert"

perhaps better to have as a priority conversation topic as I feel that
others will have opinions on this who are not part of this thread.

On Mon, Oct 7, 2019 at 9:49 AM dustymc notifications@github.com wrote:

>

  1. I don't think that's a blocker - we can flip labels whenever we
    want.
  2. Treating an explicitly-subjective NULLable field that many
    collections have said they won't use as "primary" doesn't make sense to
    me.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=AATH7UL2JN3SPRV7SWSWKMTQNNSCRA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEARBHLI#issuecomment-539104173
,
or mute the thread
<
https://github.com/notifications/unsubscribe-auth/AATH7UP2N6DTTGN4YDHLUA3QNNSCRANCNFSM4IEIGLOA

.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM3CIP6Z6MMXKNONDRTQNNS35A5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEARB4LA#issuecomment-539106860,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ACFNUMZTNRNF3EEYNUCX3XTQNNS35ANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

backwards

Certainly somewhere we've been before.

method

That's just labels - we can go there without getting stuck here.

so maybe ID confidence is the new nature of id-- put that at top
ID method is a new field with the nature of id values with all the
variation that you need.

this would map to DwC identificationVerificationStatus and
identificationRemarks respectively, right?

On Mon, Oct 7, 2019 at 10:49 AM dustymc notifications@github.com wrote:

backwards

Certainly somewhere we've been before.

method

That's just labels - we can go there without getting stuck here.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=AATH7UM4OG2372H4MIMQH73QNNZBDA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEARHAAI#issuecomment-539127809,
or mute the thread
https://github.com/notifications/unsubscribe-auth/AATH7UNTWTZSABSPFVQUYVTQNNZBDANCNFSM4IEIGLOA
.

ID confidence is the new nature of id

But it's not! One can be confident (or not) of IDs of any nature.

identificationVerificationStatus

That's basically a term for a simpler model. We "verify" by adding identifications. I suppose I could somehow stuff something that means "28 people agree on this" (http://arctos.database.museum/guid/MSB:Mamm:55245) in there, but that's not very meaningful without https://github.com/ArctosDB/arctos/issues/2267 - spelling something the same way doesn't necessarily mean you think it's the same thing, and vice-versa.

I like Derek's suggestion of ID confidence + ID method. For method, we want to know if you looked at features first, then based subspecies on geographic distribution, then sequenced and found it's really some other taxon, etc. For observations, maybe it's based on features (saw a moose) or audio-visual.

Add confidence on top of that.

What about adding a third field for ID determination:
field
museum
lab

method

I think that's usefully in id_sensu (which might need a new name? We should get taxon concepts settled in first) - I think every method worth recording is going to require a publication to describe.

I'm not seeing what value
field
museum
lab
would add??

you looked at features first, then based subspecies on geographic distribution, then sequenced

Those should be three IDs.

When will the "legacy" options stop showing up in the data entry screen pick list?

When I get a migration path and they're no longer used.

Yes, those would be three IDs. But I think it's useful to know whether the ID was made in the field (first attempt, without comparative material or more fine-scaled methods) or not. Maybe just a flag "Is this a field ID" ?

I feel like we need to 'regroup.' From what I can tell, here is the latest version (see google doc):

-- fine features + coarse features ---> _I suggest combining these two into a single term 'features', but this is still being debated_
-- function
-- molecular
-- distribution (do we need 'geographic'?)
-- karyotype
-- relationship
-- audio-visual
-- taxonomic revision
-- unknown

Are we agreed on all of these except for whether to break up 'features' ?

Confidence level added to this:
-- unknown (= default?)
-- low
-- medium
-- high

Is this where we stand now?

"Is this a field ID"

I think that takes up back to where we are - random data that doesn't really do anything - but with a lot more work.

without comparative material
fine-scaled methods

I think the idea that those don't exist in the field is a bit discipline-specific. In any case they may have a place here, but they should be quantified/categorized as what they are, not lumped into whatever "field" might mean.

I don't think we have to agree on 'features' except we should get rid of anything that won't be used. (And I believe that ship sailed a few days ago.)

The default for confidence will be NULL. I'm still pretty baffled as to what purpose "unknown" serves or why it is necessary.

Here's current data and proposed mappings.

UAM@ARCTOS> select nature_of_id,count(*) from identification group by nature_of_id order by nature_of_id;

NATURE_OF_ID COUNT(*)


ID of kin 29755 --------------------------> relationship
ID to species group 9591------------------> unknown
coarse features 55------------------------> none/happy
curatorial 14226----------------------> unknown (or features?)
erroneous citation 156------------------------> unknown
expert 725767----------------------> unknown (or features?)
features 269----------------------------->none/happy
field 352627-----------------------> unknown (or features?)
fine features 75-------------------------->none/happy
geographic distribution 25559------------------->none/happy
molecular 13888------------------------->none/happy
photograph 495-----------------------> unknown (or features?)
published referral 3953-----------------------> unknown
revised taxonomy 258734----------------------->none/happy
student 354338-----------------------> unknown (or features?)
type specimen 23663-----------------------> unknown
unknown 2555259-----------------------> >none/happy

Again, PLEASE give me better mapping, globally or for your collection/parts of your collection.

My suggestions in bold (at least for MVZ Birds, and probably for the other MVZ collections although other staff curators should weigh in @atrox10 @mkoo). I still don't like having three terms for 'features' but I guess that's a curatorial call on how we use it.

ID of kin 29755 --------------------------> relationship
ID to species group 9591------------------> unknown CC: features
coarse features 55------------------------> none/happy
curatorial 14226----------------------> unknown (or features?) CC: features
erroneous citation 156------------------------> unknown
expert 725767----------------------> unknown (or features?) CC: features
features 269----------------------------->none/happy
field 352627-----------------------> unknown (or features?) CC: features
fine features 75-------------------------->none/happy
geographic distribution 25559------------------->none/happy
molecular 13888------------------------->none/happy
photograph 495-----------------------> unknown (or features?) CC: features
published referral 3953-----------------------> unknown
revised taxonomy 258734----------------------->none/happy
student 354338-----------------------> unknown (or features?) CC: features
type specimen 23663-----------------------> unknown CC: features
unknown 2555259-----------------------> >none/happy

I agree with Carla's mapping. I also support just using "features" as a
single term.
Do we have a map already from these terms to confidence level?

On Wed, Oct 9, 2019 at 8:59 AM Carla Cicero notifications@github.com
wrote:

My suggestions in bold (at least for MVZ Birds, and probably for the other
MVZ collections although other staff curators should weigh in @atrox10
https://github.com/atrox10 @mkoo https://github.com/mkoo). I still
don't like having three terms for 'features' but I guess that's a
curatorial call on how we use it.

ID of kin 29755 --------------------------> relationship
ID to species group 9591------------------> unknown CC: features
coarse features 55------------------------> none/happy
curatorial 14226----------------------> unknown (or features?) CC:
features

erroneous citation 156------------------------> unknown
expert 725767----------------------> unknown (or features?) CC: features
features 269----------------------------->none/happy
field 352627-----------------------> unknown (or features?) CC: features
fine features 75-------------------------->none/happy
geographic distribution 25559------------------->none/happy
molecular 13888------------------------->none/happy
photograph 495-----------------------> unknown (or features?) CC:
features

published referral 3953-----------------------> unknown
revised taxonomy 258734----------------------->none/happy
student 354338-----------------------> unknown (or features?) CC:
features

type specimen 23663-----------------------> unknown CC: features
unknown 2555259-----------------------> >none/happy

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBDKNNAILYU5MZSI2OTQNXWTLA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAYF3ZI#issuecomment-540040677,
or mute the thread
https://github.com/notifications/unsubscribe-auth/ADQ7JBHZHQLWJWWM7BCL5J3QNXWTLANCNFSM4IEIGLOA
.

map already from these terms to confidence level

No - I can handle that if you want, but without feedback from you I think I can only map to NULL.

@DerekSikes has that mapped for his collections in https://github.com/ArctosDB/arctos/issues/2170#issuecomment-537634372 - I can extend that to the default if you'd like.

Do we also need a "taxon description" or something confidence value for the types?

The data suggest that value is not being used correctly which makes me a bit hesitant to double down on it, but it's just a vocabulary term at this point.

  type_status,
  count(*)
from
  identification
  left outer join citation on identification.collection_object_id=citation.collection_object_id
where
  nature_of_id='type specimen'
group by
  type_status
order by
  type_status
 13  ;

TYPE_STATUS          COUNT(*)
------------------------------ ----------
basis of illustration             138
erroneous citation             75
holotype                 1667
host voucher                  598
isolectotype                8
isoneotype              2
isosyntype              4
isotype                   100
lectotype                  48
neotype                 7
paralectotype                  34
paratopotype                 1887
paratype                 3694
referral                  239
symbiotype                162
syntype                    72
voucher                 21049
                     2744

Agree with using Derek's as default with the following additions - please review

ID of kin 29755 --------------------------> relationship confidence high
ID to species group 9591------------------> unknown CC: features confidence low
coarse features 55------------------------> none/happy
curatorial 14226----------------------> unknown (or features?) CC: features confidence high
erroneous citation 156------------------------> unknown confidence low
expert 725767----------------------> unknown (or features?) CC: features confidence high
features 269----------------------------->none/happy
field 352627-----------------------> unknown (or features?) CC: features confidence low
fine features 75-------------------------->none/happy
geographic distribution 25559------------------->none/happy confidence medium
molecular 13888------------------------->none/happy confidence high
photograph 495-----------------------> unknown (or features?) CC: features confidence medium
published referral 3953-----------------------> unknown confidence high
revised taxonomy 258734----------------------->none/happy confidence high
student 354338-----------------------> unknown (or features?) CC: features confidence medium
type specimen 23663-----------------------> unknown CC: features confidence high
unknown 2555259-----------------------> >none/happy

I am happy to map whatever you want for your collection, but as global defaults I have some concerns.

ID of kin 29755 --------------------------> relationship confidence high

This seems wrong to me - the confidence comes from the identification of the related specimen.

ID to species group 9591------------------> unknown CC: features confidence low

I've never been sure of the intent here, but I _think_ it would be "we're fairly confident, but to an imprecise taxon." I have no better suggestions....

erroneous citation 156------------------------> unknown confidence low
That one's just weird - "we're sure we're wrong"???

field 352627-----------------------> unknown (or features?) CC: features confidence low

This is somewhat taxon-dependent, which I probably can't get at for the purposes of migration. This seems correct for shrews, maybe not so much for bison.

revised taxonomy 258734----------------------->none/happy confidence high

I think this also depends on previous IDs. I see no reason "Clethrionomys, maybe" should become "Myodes, definitely."

We use ID of kin for Mexican wolves in an endangered species recovery program. Our confidence in the ID is extremely high. I would hope that they aren't accidentally breeding and releasing coyotes . . .

I could go with ID to species group as medium. You know at least genus.

I don't know about erroneous citation - who uses that? It is in the type definition field.

Agree with revised taxonomy depending on previous, but what about legacy? maybe default to medium?
@DerekSikes @jldunnum

One thing on the "ID of kin" values - for a long time, this was (still is?) the default value in data entry rather than leaving the default as NULL.. Can you send a list of ID of kin specimens so we can verify this is really the case? At MSB, my guess is ID of kin should only legitimately be used for Canis lupus baileyi and possibly embryos cataloged separately.
@jldunnum

erroneous citation - who uses that

https://github.com/ArctosDB/arctos/issues/2170#issuecomment-538076627 or...

guid_prefix, count(*) c 
from collection,cataloged_item,identification where 
  4  nature_of_id='erroneous citation' and collection.collection_id=cataloged_item.collection_id and cataloged_item.collection_object_id=identification.collection_object_id group by guid_prefix order by guid_prefix;

GUID_PREFIX                               C
------------------------------------------------------------ ----------
DGR:Bird                                  1
DMNS:Bird                                15
DMNS:Inv                                  2
KNWR:Ento                                 1
KWP:Ento                                  2
MSB:Mamm                                 45
MVZ:Herp                                  5
MVZ:Mamm                                  1
UAM:ES                                    1
UAM:Ento                                 45
UAM:Herb                                 10
UAM:Mamm                                  1
UAMObs:Ento                              22
UCM:Bird                                  1
UTEP:ES                                   3
UTEP:Herb                                 1

16 rows selected.


what about legacy

Sounds like NULL to me; there's no reason to assert anything you can't know.

list

create table temp_nok as select guid, identification.scientific_name,accepted_id_fg from flat,identification where flat.collection_object_id=identification.collection_object_id and identification.nature_of_id='ID of kin';

temp_nok.csv.zip

Going back to @ ccicero

Yes, those would be three IDs. But I think it's useful to know whether the ID was made in the field (first attempt, without comparative material or more fine-scaled methods) or not. Maybe just a flag "Is this a field ID" ?

What about all of the specimens that are IDed in the lab first rather then the field? I make a lot of quick ID calls when specimens are donated as salvage to the museum, and then stuff them in a freezer for future prepping. I'd count those as Field IDs (just not made in the field), but would this be confusing to someone trying to follow?

Good point Beth. We also use(d) 'field' ID for salvaged specimens prepped in the lab because that was the most fitting option, but it was always strange to do so. I think 'features' takes care of both field and lab preps.

Re: confidence levels mappings:

ID of kin 29755 --------------------------> relationship confidence high
---> I think high is appropriate here, contrary to Dusty's comment. If you know that the parent or sibling is species X, then you should be confident that what you are ID'ing is also species X.

ID to species group 9591------------------> unknown CC: features confidence low

---> I agree with Dusty's comment here. "Medium" seems ok to me.

coarse features 55------------------------> none/happy
curatorial 14226----------------------> unknown (or features?) CC: features confidence high

----> not sure what 'curatorial' means, but I think this would depend on who's doing the ID. If it's a student, then confidence is lower than if ID is by a curator. I'm not sure that we've used this for MVZ collections, can you tell me if we have any with this ID, and the identifier?

erroneous citation 156------------------------> unknown confidence low
expert 725767----------------------> unknown (or features?) CC: features confidence high
features 269----------------------------->none/happy
field 352627-----------------------> unknown (or features?) CC: features confidence low

---> Yes, per Dusty's comment that it would be taxon-dependent. But presumably if you only can ID to a genus in the field, then that's the ID that would have been entered. Or if you're not sure of the ID, then you'd add a "?". So can we get at this through taxa_formula?
if formula contains 'sp' or 'ssp' or 'string' or '?' of 'cf' or 'aff'-or 'A or B' --> confidence is low
if formula contains 'A' or 'A and B' ---> confidence is high
if formula contains 'A / B intergrade' or 'A x B' ---> confidence is low?

fine features 75-------------------------->none/happy
geographic distribution 25559------------------->none/happy confidence medium

---> This can be low or high depending on the taxon and locality, so I am ok with medium.

molecular 13888------------------------->none/happy confidence high
photograph 495-----------------------> unknown (or features?) CC: features confidence medium
published referral 3953-----------------------> unknown confidence high
revised taxonomy 258734----------------------->none/happy confidence high
student 354338-----------------------> unknown (or features?) CC: features confidence medium

---> This can be low or medium depending on the student. We don't use this at MVZ, but I guess medium is ok?

type specimen 23663-----------------------> unknown CC: features confidence high
unknown 2555259-----------------------> >none/happy

@ccicero thanks!

I think there's a bunch of stuff confounded at the intersection of nature and formula. There are ~2500 expert+"A ?" IDs, for example, I'm not sure if that's what you mapped or not. Data attached. I very tentatively suggest we don't pursue that as globals - it's fine for MVZ if that's how it's been used there.

create table temp_lcidf as 
  select 
  guid_prefix, 
  taxa_formula,
  nature_of_id,
  count(*) c 
from 
  collection,
  cataloged_item,
  identification 
where 
 collection.collection_id=cataloged_item.collection_id and 
 cataloged_item.collection_object_id=identification.collection_object_id and
 taxa_formula in ('A ?','A sp.','A ssp.','A {string}','A aff.','A cf.','A or B')
group by 
  guid_prefix, 
  taxa_formula,
  nature_of_id
order by 
  guid_prefix, 
  taxa_formula,
  nature_of_id
;

temp_lcidf.csv.zip

There are a few MVZ/student IDs.

MVZ:Bird 2
MVZ:Herp 18
MVZ:Mamm 858

add a "?"

Good point- can we (long after the dust has settled here!) get rid of that formula, or is it somehow different than confidence?

'A or B'

At least some of those are low-precision but not necessarily low-confidence. "It's one of these two almost-identical shrews; I can't tell them apart." I don't think there's any confidence embedded in that. @amgunderson @KyndallH

add a "?"

Good point- can we (long after the dust has settled here!) get rid of that
formula, or is it somehow different than confidence?

I've long thought that there's serious redundancy between an ID with nature
of ID = student and formula = A ?

One pro of keeping A ? is that string gets sent to GBIF etc and is more
visible than the nature of ID information (I'm not even sure that gets
shared - does it?)

But with ID confidence as a field, it seems silly to retain formula A ?
since this would be the same as confidence = low (or possibly medium).

-Derek

On Thu, Oct 10, 2019 at 8:34 AM dustymc notifications@github.com wrote:

@ccicero https://github.com/ccicero thanks!

I think there's a bunch of stuff confounded at the intersection of nature
and formula. There are ~2500 expert+"A ?" IDs, for example, I'm not sure if
that's what you mapped or not. Data attached. I very tentatively suggest we
don't pursue that as globals - it's fine for MVZ if that's how it's been
used there.

create table temp_lcidf as
select
guid_prefix,
taxa_formula,
nature_of_id,
count(*) c
from
collection,
cataloged_item,
identification
where
collection.collection_id=cataloged_item.collection_id and
cataloged_item.collection_object_id=identification.collection_object_id and
taxa_formula in ('A ?','A sp.','A ssp.','A {string}','A aff.','A cf.','A or B')
group by
guid_prefix,
taxa_formula,
nature_of_id
order by
guid_prefix,
taxa_formula,
nature_of_id
;

temp_lcidf.csv.zip
https://github.com/ArctosDB/arctos/files/3713618/temp_lcidf.csv.zip

There are a few MVZ/student IDs.

MVZ:Bird 2
MVZ:Herp 18
MVZ:Mamm 858

add a "?"

Good point- can we (long after the dust has settled here!) get rid of that
formula, or is it somehow different than confidence?

'A or B'

At least some of those are low-precision but not necessarily
low-confidence. "It's one of these two almost-identical shrews; I can't
tell them apart." I don't think there's any confidence embedded in that.
@amgunderson https://github.com/amgunderson @KyndallH
https://github.com/KyndallH

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUMYZ3FZB4W6JXJLZESTQN5KQLA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEA47IPA#issuecomment-540668988,
or unsubscribe
https://github.com/notifications/unsubscribe-auth/ACFNUM5CXAUEZ2Y4WU57P6DQN5KQLANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

Yes, there is some redundancy between ID with "?" and confidence=low, but I favor keeping the "?" because that is passed to data aggregators whereas confidence is/will not. Also, it makes it evident right in the name that the ID is questionable. I'm not sure how confidence will be displayed, but presumably smaller font below the name?

expert + A ? No, I didn't map that specifically, but that implies (to me) that someone who knows something (e..g. a Jim Patton or Peter Pyle, I would call them both experts) still have some questions about ID. So I don't think we can go strictly from nature of ID to the mapping without considering the formula. In fact, I would say that formula might be more informative for mapping than nature, but for some both need to be considered.

I agree re: 'A or B'.

How's this:

if formula contains 'sp' or 'ssp' or 'string' or '?' of 'cf' or 'aff' --> confidence is low
if formula contains 'A or B' ---> confidence is medium
if formula contains 'A' or 'A and B' ---> use suggested 'nature' mappings
if formula contains 'A / B intergrade' or 'A x B' ---> depends on 'nature' - if molecular, probably higher than by features (?)

Can we get this into a google doc somehow (we already have two going, but it might help to start a clean sheet with these different possible combinations to help in mapping)?

passed to data aggregators

I don't think that's ever a reason to model data - we can map WHATEVER.

not sure how confidence will be displayed

That's UI and "easy" (ish...) to change - you can search now.

Screen Shot 2019-10-10 at 1 11 40 PM

Screen Shot 2019-10-10 at 1 12 22 PM

google doc

Good idea - I'm on it.

This is making my life difficult - I am not able to load anything in the bulkloader because "Legacy nature_of_id terms are disallowed" and I have no idea what to enter in nature of ID anymore....

bulkloader

Let me know if you want me to update something there.

what to enter

http://arctos.database.museum/info/ctDocumentation.cfm?table=CTNATURE_OF_ID

There's the start of a migration document here:

https://docs.google.com/spreadsheets/d/14IOPiv2vHbZv30N3975Y80wf2-_1__Xy1I7ZfXI1FSM/edit?usp=sharing

Please start your review of this in the "first_update" tab. I will make two layers of updates from this if necessary:

  1. I can make smaller-scale updates for specific groups of specimens (collection, institution, whatever). For example, I've mapped that I'll first update all nature_of_id="original_data" to nature_of_id="example" with a confidence of "example" for collection "Fake:Example."
  2. Anything not covered in (1) will be updated for all remaining collections. "type specimen" will become "features" with confidence "high" for any collections that are still using "type specimen" (eg, those that haven't requested a custom update) when I get to this step, for example.

Second, I will update again for everything in the "second_update" tab. So all MVZ specimens with a "A sp." formula are currently mapped to confidence=low, even if they were mapped as confidence=high in the "first_update" tab. If there's agreement on confidence mapping "for_collections" can just be changed to 'all' or similar, if not I will run that step only for those collections that request it.

The existing/old values will still be copied to remarks.

Mapping to aggregators: I agree that we should do what's best for us, but it doesn't hurt to keep the "?" and if we get rid of that and only have confidence, that will likely be buried in a remark somewhere. Given that a lot of requests come through VerNet/GBIF, searches, I think it's worth keeping the "?"

Google doc: I'll look at that as soon as possible, it may be a few days. Thanks.

Sorry I haven't been able to make recent AWG meetings and actually contribute constructively. Instead I'm just sending whiny emails.
A few things:
I know we have lots of IDs which have incorrect nature of ID values generated by our previous DGR collection manager prior to Mariel taking the position. These are primarily the result of inattention to "ID of kin" being set as default or incorrect use of "Type specimen". These remain because we haven't had time to carefully go through them all yet and it is super labor intensive to change them. But I guess this needs to happen quickly now to avoid them being modified and thus harder to track down going forward.

Secondly, I have to say I liked "curatorial" and "field" as they quickly informed that a specimen had either only gone through a provisional first pass in the field or if it had been revisited and examined more closely. I liked being able to see that history and when coupled with ID agent it provided the ID confidence. Now the ID history will appear as "feature" and if looked at again it will have a second "feature", but presumably with a higher confidence value. How will confidence value be displayed? Will it be in the ID history as well?

Overall, I am just very leery about the subjective nature of assigning confidence values (I certainly don't want to assign values to all the legacy IDs which we have no idea on). Even something like "molecular" isn't always going to be high confidence. Just depends on what taxa you included in your analyses or what sequences are available in Genbank when you blasted your sequence.


Jonathan L. Dunnum Ph.D.
Senior Collection Manager
Division of Mammals, Museum of Southwestern Biology
University of New Mexico
Albuquerque, NM 87131
(505) 277-9262
Fax (505) 277-1351

MSB Mammals website: http://www.msb.unm.edu/mammals/index.html
Facebook: http://www.facebook.com/MSBDivisionofMammals

Shipping Address:
Museum of Southwestern Biology
Division of Mammals
University of New Mexico
CERIA Bldg 83, Room 204
Albuquerque, NM 87131


From: dustymc notifications@github.com
Sent: Thursday, October 10, 2019 3:03 PM
To: ArctosDB/arctos arctos@noreply.github.com
Cc: Jonathan Dunnum jldunnum@unm.edu; Mention mention@noreply.github.com
Subject: Re: [ArctosDB/arctos] nature of ID (#2170)

There's the start of a migration document here:

https://docs.google.com/spreadsheets/d/14IOPiv2vHbZv30N3975Y80wf2-_1__Xy1I7ZfXI1FSM/edit?usp=sharing

Please start your review of this in the "first_update" tab. I will make two layers of updates from this if necessary:

  1. I can make smaller-scale updates for specific groups of specimens (collection, institution, whatever). For example, I've mapped that I'll first update all nature_of_id="original_data" to nature_of_id="example" with a confidence of "example" for collection "Fake:Example."
  2. Anything not covered in (1) will be updated for all remaining collections. "type specimen" will become "features" with confidence "high" for any collections that are still using "type specimen" (eg, those that haven't requested a custom update) when I get to this step, for example.

Second, I will update again for everything in the "second_update" tab. So all MVZ specimens with a "A sp." formula are currently mapped to confidence=low, even if they were mapped as confidence=high in the "first_update" tab. If there's agreement on confidence mapping "for_collections" can just be changed to 'all' or similar, if not I will run that step only for those collections that request it.

The existing/old values will still be copied to remarks.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHubhttps://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=AED2PAZRWTW3BLDEIRCJMMTQN6KCLA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEA557GY#issuecomment-540794779, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AED2PA653YYGDOZ3BP46VFLQN6KCLANCNFSM4IEIGLOA.

labor intensive

As always, I'm happy to try to help that - I just need some way to find existing things and new values. (Sorry I can't help much with the values!)

track down

It's not ideal, but I will retain the old values in remarks, and I can help you find records etc as necessary.

displayed

See https://github.com/ArctosDB/arctos/issues/2170#issuecomment-540767900

history

Yes, it's part of the ID and will be preserved as such.

leery about the subjective nature of assigning confidence values

Yes, I agree, which is why I initially mapped the default to not bring them in at all. It does seem somehow vaguely useful in distinguishing between eg, curatorial and field (which I'd also classify as "vaguely useful" so I don't think much changes there). It also seems very useful to me in noting specific events on specific specimens - "I'm an expert, this is my taxon, I'm not sure about this particular ID because the diagnostic features are mangled" becomes readily searchable rather than something that would have to be extracted from remarks.

In any case I'm happy to do something special for MSB, revise the defaults for everyone, revisit this later from the "old" nature data preserved in remarks, etc.

OK, I'll think about how best to search for our problem children and get back with you with potential ideas. Thanks!


Jonathan L. Dunnum Ph.D.
Senior Collection Manager
Division of Mammals, Museum of Southwestern Biology
University of New Mexico
Albuquerque, NM 87131
(505) 277-9262
Fax (505) 277-1351

MSB Mammals website: http://www.msb.unm.edu/mammals/index.html
Facebook: http://www.facebook.com/MSBDivisionofMammals

Shipping Address:
Museum of Southwestern Biology
Division of Mammals
University of New Mexico
CERIA Bldg 83, Room 204
Albuquerque, NM 87131


From: dustymc notifications@github.com
Sent: Friday, October 11, 2019 11:42 AM
To: ArctosDB/arctos arctos@noreply.github.com
Cc: Jonathan Dunnum jldunnum@unm.edu; Mention mention@noreply.github.com
Subject: Re: [ArctosDB/arctos] nature of ID (#2170)

labor intensive

As always, I'm happy to try to help that - I just need some way to find existing things and new values. (Sorry I can't help much with the values!)

track down

It's not ideal, but I will retain the old values in remarks, and I can help you find records etc as necessary.

displayed

See #2170 (comment)https://github.com/ArctosDB/arctos/issues/2170#issuecomment-540767900

history

Yes, it's part of the ID and will be preserved as such.

leery about the subjective nature of assigning confidence values

Yes, I agree, which is why I initially mapped the default to not bring them in at all. It does seem somehow vaguely useful in distinguishing between eg, curatorial and field (which I'd also classify as "vaguely useful" so I don't think much changes there). It also seems very useful to me in noting specific events on specific specimens - "I'm an expert, this is my taxon, I'm not sure about this particular ID because the diagnostic features are mangled" becomes readily searchable rather than something that would have to be extracted from remarks.

In any case I'm happy to do something special for MSB, revise the defaults for everyone, revisit this later from the "old" nature data preserved in remarks, etc.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHubhttps://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=AED2PA5E3PKD2F6XACPAHHDQOC3HVA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEBAWXHQ#issuecomment-541158302, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AED2PA5GTEMNXXHKFFQIQBTQOC3HVANCNFSM4IEIGLOA.

I just thought about another complication with relates to another issue of multiple IDs in data entry.

If you use can only identify a specimen to genus, and use taxon A, then confidence = high
Same scenario but you use A sp, then confidence = low

Similarly, we often identify a specimen to species (formula A) in the field or during preparation, but then identify it to subspecies in the museum based on either features or distribution. Sometimes we can't ID to subspecies, so we use A ssp. We could leave it as taxon A, but assigning 'ssp' suggests that we tried to ID to subspecies but couldn't with confidence so we didn't. In this scenario:
taxon A = confidence high
taxon A ssp. = confidence low

So it seems to me that you can assign different confidence levels depending on the taxon formula you use.

One way to deal with that, which is what the other issue is getting at, is to allow 2 IDs in data entry/bulkloading. Alternatively, we could do the initial ID in data entry/bulkloading and then go add the second ID/formula once the data are loaded, but that is an extra step and I think it's better to do it at the time you're doing the data entry (especially since there is often a lag between when data are entered by students and checked/loaded by staff).

For data entry/bulkloading, we need a way to add both IDs with the determiner, date, nature, and confidence level, PLUS indicate which is the accepted ID. Can we address the data entry issue along with this issue?

can only identify a specimen to genus, and use taxon A, then confidence = high

I think these are two very different things. "I'm sure it's that genus" and "I think it's probably that genus" are both common scenarios.

assigning 'ssp' suggests that we tried to ID to subspecies but couldn't with confidence so we didn't

That's not what I'd get from there! (I thought it just meant "there are subspecies for this species.") Again I think confounding any >1 concepts is going to lead to confusion. We now have a way to explicitly record confidence, and trying to derive it from anything else is just going to confuse most users.

In any case I could implement https://github.com/ArctosDB/arctos/issues/727 in ~a week (just needs prioritized, especially in relation to PG and the other "emergency" issues that have come up in the past few days), or we might find a way to implement https://github.com/ArctosDB/arctos/issues/2178 as part of SABI, which has the potential to allow any number of just about anything to be loaded ~simultaneously with the core record.

This tripped me up while loading specimen through the bulkloader (both from single entries in Data Entry and from uploaded .csv). I was using 'expert' and 'field' as nature of ID, but the error reads: b_bulkload: b_bulkload: : ORA-20001: Legacy nature_of_id terms are disallowed; see https://github.com/ArctosDB/arctos/issues/2170ORA-06512: at "UAM.TEMP_TR_ID_BIU", line 4ORA-04088: error during exe {snip...}
Can we remove these outdated nature_of_ID's from the dropdowns? Why are we allowed to select these options if they will just fail the validation?
OR: Can we edit the error message to point to the code table instead of this issue (there's a lot to read here)?

Also, there is no 'confidence' field on the Data Entry page.

remove these outdated nature_of_ID's from the dropdowns

Not while they're still used, and something I thought was fully resolved seems to have started over.

point to the code table

Sure - but I think it's already linked everywhere this can be used??

Data Entry

https://github.com/ArctosDB/arctos/issues/2170#issuecomment-537600903

Thanks (and sorry for the repetition, I was skimming too quickly through this issue).

:wave: @dustymc, finalize this

A sp. and A ssp. are the same concepts - you can identify to some level (genus or species but not to species or subspecies, respectively).

Sure, there can be different scenarios where confidence is high or medium or low at any level (confident of genus but not species, or species but not subspecies). That's my point.

Maybe this is (partly?) resolved by getting rid of 'sp' and 'ssp' (and 'aff' and 'cf' ?) and using A {string} for those IDs per issue 1304. If we do that, you'd select A for the genus or species and assign a confidence value, but then put 'sp' or 'ssp' in the string - no need to assign a confidence level to that. However, that still brings up an issue because the way the form is now, you're assigning confidence to the whole ID "A {string}" when really you want to assign confidence to A. So I'm not sure what confidence to assign for "A {string}" when it's 'sp' or 'ssp'

Let's talk about 727 at the next issues meeting. This is high priority for me, especially with the changes, because it's usually a two-step process for identifying specimens to subspecies which is typical for birds and mammals: field/features ID to species, then geographic distribution (plus features in more complicated cases) to subspecies. It would be nice to be able to create bth IDs when doing data entry, each with their own confidence.

FINALLY - regarding migration: when you do the final migration, can you also update the values for nature_of_ID that are in bulkloader files so those users don't have to worry about changing those manually!?

final migration

Unless something drastic happens, I'll probably aim for Friday - Arctos has been really weird for me lately, hopefully that'll help mitigate any meltdowns (or at least give ya'll an excuse to quit early!). Yes I can get the bulkloader too - or I can do that now?

I would just do the bulkloader at the same time.

Arctos has been very SLOW lately, not sure if that's what you're talking about though.

Thanks.

OK I'll plan on that.

Yes that's what I'm talking about - neither I nor TACC can fully explain it, which is making me more paranoid than usual....

Yes, horribly slow today! I was trying to finish a loan to get to Fedex
before the 7pm deadline, and could not get barcodes to load to parts.

On Wed, Oct 16, 2019 at 5:09 PM dustymc notifications@github.com wrote:

OK I'll plan on that.

Yes that's what I'm talking about - neither I nor TACC can fully explain
it, which is making me more paranoid than usual....

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBCA4ULDQEPNNPY6YYTQO6NKVA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEBOGXIQ#issuecomment-542927778,
or unsubscribe
https://github.com/notifications/unsubscribe-auth/ADQ7JBD43EUCF635QTGCP3DQO6NKVANCNFSM4IEIGLOA
.

Back to 'features' - Is there a functional difference between 'coarse features' and 'features' ? I can see keeping 'fine features' as separate (looked under a microscope or whatever), but 'looks like a moose' by its features seems the same as by its coarse features. What do folks think about reducing this to just 'features' and 'fine features' to make the data more consistent in how these are used?

Here's who's using %features%

guid_prefix, nature_of_id,count(*) c 
from collection,cataloged_item,identification where 
nature_of_id like '%features' and 
collection.collection_id=cataloged_item.collection_id and 
cataloged_item.collection_object_id=identification.collection_object_id 
  7  group by guid_prefix,nature_of_id order by guid_prefix,nature_of_id;

GUID_PREFIX      NATURE_OF_ID          C
-------------------- -------------------- ----------
BYU:Herp         coarse features          80
BYU:Herp         features            742
BYU:Herp         fine features       121
CHAS:Teach       coarse features          62
CHAS:Teach       fine features         1
DMNS:Bird        features             16
DMNS:Mamm        coarse features           2
DMNS:Mamm        features              8
DMNS:Mamm        fine features         3
KWP:Ento         fine features       179
MSB:Herp         features              5
MSB:Mamm         coarse features         101
MSB:Mamm         features            261
MSB:Mamm         fine features         1
MVZ:Bird         coarse features           9
MVZ:Bird         features             43
MVZ:Herp         features              1
MVZ:Herp         fine features         4
MVZ:Mamm         features              1
UAM:EH           fine features         3
UAM:Ento         coarse features          98
UAM:Ento         fine features        72
UAMObs:Ento      fine features        10
UCM:Bird         coarse features          11
UCM:Bird         fine features         1
UCM:Herp         fine features         1
UCSC:Bird        features            554
UCSC:Herp        features              1
UMNH:Mamm        features              9
UMNH:Teach       features             64
UNR:Herp         features            268
UNR:Mamm         features              4
UWYMV:Fish       features             18

33 rows selected.

I thought we'd given up on the coarse and fine distinction and were just
going with features, which is what I've been using last couple of days.

On Thu, Oct 17, 2019 at 11:22 AM Carla Cicero notifications@github.com
wrote:

Back to 'features' - Is there a functional difference between 'coarse
features' and 'features' ? I can see keeping 'fine features' as separate
(looked under a microscope or whatever), but 'looks like a moose' by its
features seems the same as by its coarse features. What do folks think
about reducing this to just 'features' and 'fine features' to make the data
more consistent in how these are used?

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ADQ7JBELMFMXBS22GFRTLNTQPCNOFA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEBQ4CDA#issuecomment-543277324,
or unsubscribe
https://github.com/notifications/unsubscribe-auth/ADQ7JBG5DROUNCJK4TE2553QPCNOFANCNFSM4IEIGLOA
.

Is this for new data or per migration requests?

Mariel - That was discussed and what I favored, but I think others still wanted to have more than just 'features.' Like I said, 'fine features' makes some sense. I also am just using 'features' and still think it's worth asking the question about coarse features. What is the functional difference between that and just 'features'?

Dusty - Please change the 9 MVZ:Bird records to just 'features' - Not sure if those are new records, but if they are, then it emphasizes my point about inconsistency in how the data are entered.

Dusty - You're still planning on doing the changes tomorrow, right? Chris says that Jim P is anxious to get his records out of the bulkloader so he can curate specimens from his most recent trip. Thanks.

I'm fine with just 2: 'features' and 'fine features' but the definition for
'features' should mention that this is used for cases when it was unknown
if the examination was of coarse or fine features.. not just an assumption
that the features were not fine.

fine,
D

On Thu, Oct 17, 2019 at 9:42 AM Carla Cicero notifications@github.com
wrote:

Dusty - You're still planning on doing the changes tomorrow, right? Chris
says that Jim P is anxious to get his records out of the bulkloader so he
can curate specimens from his most recent trip. Thanks.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM3325PFIXAKCOPLNXLQPCPX3A5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEBQ56OA#issuecomment-543285048,
or unsubscribe
https://github.com/notifications/unsubscribe-auth/ACFNUM6MAA45SXOTDOP4IKLQPCPX3ANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

change the 9 MVZ:Bird

done

changes tomorrow,

If I think I can without melting something important, yes.

out of the bulkloader

I can help with that at any time, or that should be straightforward with SQL option.

Thanks Derek. I agree re: definition.

Dusty - Can you go ahead and make the nature of ID change to the records under username 'patton' in the bulklloader? Then he can load them. Thanks.

patton

Done

We're getting error messages when we try to update an identification on an existing record. The messages refer us to this Issue but I don't see a resolution here.

This is our first update attempt.

ID error - input

This is the error message which directs us to this issue.

Error message

The handbook references #515.

We are not trying to use "legacy" as the initial ID, so is the error message a bug?

If we select "revised taxonomy" for the new ID, we sometimes get this error message. We get whether or not we have completed the "confidence" field.

revised taxonomy

Right now, we seem locked out of updating identifications.

P.S. We can get it to work with some other selections such as "audio-visual" then change it to student or revised taxonomy before we save it.

Thanks D.

Phyllis - 'student' is a legacy value that's no longer allowed. As soon as Dusty makes the changes (hopefully tomorrow), he'll clean up the list so it only has the new values. Sorry for the inconvenience in the interim.

Thanks, Carla. I can't find in this thread what "student" will be changed to (like "legacy" was changed to "unknown"). Is the new value in the list? Or is it just a confidence level? Sorry I didn't keep up with this thread.

OK, so "student" will become "features" at medium confidence. Thanks.

For the terminology:

audio-visual

might be better as

audio-video

which would unambiguously match its definition. Currently I could see
someone unfamiliar with the definitions using this as a synonym of
features. Changing visual to video would prevent that.

And I think we agreed to eliminate 'coarse features' and just use 'features'

And I will reiterate that 'type specimen' should not become legacy - there
are lots of such IDs that lack citations and eliminating this option would
toss valuable data.

-Derek

On Thu, Oct 17, 2019 at 11:36 AM dustymc notifications@github.com wrote:

See
http://arctos.database.museum/info/ctDocumentation.cfm?table=CTNATURE_OF_ID
for terminology and
https://docs.google.com/spreadsheets/d/14IOPiv2vHbZv30N3975Y80wf2-_1__Xy1I7ZfXI1FSM/edit#gid=1160354566
for the "unless someone tells me otherwise..." mapping.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUMZ5K72Q7YDQJSEKBKLQPC5FPA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEBRIVZA#issuecomment-543328996,
or unsubscribe
https://github.com/notifications/unsubscribe-auth/ACFNUM4WWH6UURQFXRDONY3QPC5FPANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

eliminate 'coarse features'

I'd just need a map for those that have used it - https://github.com/ArctosDB/arctos/issues/2170#issuecomment-543281386

type specimen

PLEASE, suggest a better mapping! https://docs.google.com/spreadsheets/d/14IOPiv2vHbZv30N3975Y80wf2-_1__Xy1I7ZfXI1FSM/edit#gid=1160354566 (And remarks will catch anything that gets tossed.)

The thing about 'type specimen' that is special and worth keeping (no need
to map if it's not changing... or just map to itself) is that when a
taxonomist describes a taxon and designates a type specimen they've often
done a hell of a lot more than what is encompassed in id method =
'features'.

They've compared those specimens to numerous others, assessed variation
across space / populations / close relatives etc. In some cases, they've
used molecular methods to assess genetic uniqueness as well as 'features'
and in other cases, rarely, they might have actually used ecological/
behavioral data to assess species status. They might have even done
breeding trials.

Regardless of how many different methods they've used, even if they
described the new species based only on 'features' there's a huge
difference between using a key someone else wrote to key out an unknown
using 'features' and a taxonomist describing a new species, and WRITING the
key and describing the species.

Additionally, because of the rules of nomenclature, type specimens ALWAYS
belong to the name - even if that name has later been deemed to not
represent a distinct species the type specimen is tied to that name by the
"laws" of nomenclature.

All this cannot be summarized as 'features' and thus I argue that 'type
specimen' should not become legacy but should remain in our list of nature
of ID options.

-Derek

On Thu, Oct 17, 2019 at 12:18 PM dustymc notifications@github.com wrote:

eliminate 'coarse features'

I'd just need a map for those that have used it - #2170 (comment)
https://github.com/ArctosDB/arctos/issues/2170#issuecomment-543281386

type specimen

PLEASE, suggest a better mapping!
https://docs.google.com/spreadsheets/d/14IOPiv2vHbZv30N3975Y80wf2-_1__Xy1I7ZfXI1FSM/edit#gid=1160354566
(And remarks will catch anything that gets tossed.)

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM6UJRRHLAWIZ3WRH4DQPDCAHA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEBRMN4Y#issuecomment-543344371,
or unsubscribe
https://github.com/notifications/unsubscribe-auth/ACFNUM45XHHHWPS5K4ZJIDTQPDCAHANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

I like audio-video

I also agree that 'type specimen' should remain an option in the list.

In some cases, they've
used molecular methods to assess genetic uniqueness as well as 'features'
and in other cases, rarely, they might have actually used ecological/
behavioral data to assess species status. They might have even done
breeding trials.

That is precisely the information I think we need to capture; if you're a whateverologist, you might want to find types defined by somethingelseology (eg, because you want to see if morphology and COI agree, or .....). Tossing all of that into "type specimen" won't support any of those questions.

A search on typestatus like %type (or something more specific) + nature_of_id=features WILL find those specimens.

If there's some value in these data, I suggest we follow MCZ's lead and create a "random types with no more information" publication to make the correct links (eg, make them discoverable along with well-documented types), even if we do it with a low-quality publication. (That probably needs a new "unknown type" or similar status??)

I checked a half-dozen of these, one of them looks like it's probably really an undocumented paratype, the rest I'm fairly certain just have random values here.

Here's a summary:

col guid_prefix format a30;
select guid_prefix, count(*) c from collection,cataloged_item,identification where 
collection.collection_id=cataloged_item.collection_id and
cataloged_item.collection_object_id=identification.collection_object_id
and identification.nature_of_id='type specimen'
and not exists (select collection_object_id from citation where citation.collection_object_id=cataloged_item.collection_object_id and type_status like '%type')
group by guid_prefix order by guid_prefix;


GUID_PREFIX             C
------------------------------ ----------
ALMNH:ES                3
DGR:Bird                6
DGR:Mamm                1
DMNS:Bird               3
DMNS:Mamm                  17
HWML:Para                1710
KNWR:Ento               9
MLZ:Bird                2
MSB:Bird                   10
MSB:Fish                8
MSB:Host                1
MSB:Mamm                 4436
MSB:Para                   44
MVZ:Bird                  130
MVZ:Herp                  518
MVZ:Mamm                 1109
MVZObs:Herp             1
UAM:Bird                  156
UAM:Ento                3
UAM:Fish                2
UAM:Herb                  218
UAM:Herp                2
UAM:Inv                 9
UAM:Mamm                 6699
UAMObs:Ento                53
UAMObs:Mamm             7
UCM:Fish                   21
UCM:Herp                  442
UCM:Mamm                   26
USNPC:Para              1
UTEP:ES                    12
UTEP:Ento                  20
UTEP:Herb                  15
UTEP:Herp                 300
UTEP:HerpOS             1
UTEP:Inv                   20
UTEP:Zoo                   23
UWBM:Herp               2


and data

create table temp_undoctypes as select guid from flat,identification where flat.collection_object_id=identification.collection_object_id
and identification.nature_of_id='type specimen'
and not exists (select collection_object_id from citation where citation.collection_object_id=flat.collection_object_id and type_status like '%type')
order by guid;

temp_undoctypes.csv.zip

It should not be up to the data entry technicians to assess all the methods
that were used by an author who describes a species. All they know is that
it's a type specimen and thus it's important to flag it as such EVEN if the
citation / publication is unknown.

-Derek

On Thu, Oct 17, 2019 at 1:00 PM dustymc notifications@github.com wrote:

In some cases, they've
used molecular methods to assess genetic uniqueness as well as 'features'
and in other cases, rarely, they might have actually used ecological/
behavioral data to assess species status. They might have even done
breeding trials.

That is precisely the information I think we need to capture; if you're a
whateverologist, you might want to find types defined by somethingelseology
(eg, because you want to see if morphology and COI agree, or .....).
Tossing all of that into "type specimen" won't support any of those
questions.

A search on typestatus like %type (or something more specific) +
nature_of_id=features WILL find those specimens.

If there's some value in these data, I suggest we follow MCZ's lead and
create a "random types with no more information" publication to make the
correct links (eg, make them discoverable along with well-documented
types), even if we do it with a low-quality publication. (That probably
needs a new "unknown type" or similar status??)

I checked a half-dozen of these, one of them looks like it's probably
really an undocumented paratype, the rest I'm fairly certain just have
random values here.

Here's a summary:

col guid_prefix format a30;
select guid_prefix, count(*) c from collection,cataloged_item,identification where
collection.collection_id=cataloged_item.collection_id and
cataloged_item.collection_object_id=identification.collection_object_id
and identification.nature_of_id='type specimen'
and not exists (select collection_object_id from citation where citation.collection_object_id=cataloged_item.collection_object_id and type_status like '%type')
group by guid_prefix order by guid_prefix;

GUID_PREFIX C


ALMNH:ES 3
DGR:Bird 6
DGR:Mamm 1
DMNS:Bird 3
DMNS:Mamm 17
HWML:Para 1710
KNWR:Ento 9
MLZ:Bird 2
MSB:Bird 10
MSB:Fish 8
MSB:Host 1
MSB:Mamm 4436
MSB:Para 44
MVZ:Bird 130
MVZ:Herp 518
MVZ:Mamm 1109
MVZObs:Herp 1
UAM:Bird 156
UAM:Ento 3
UAM:Fish 2
UAM:Herb 218
UAM:Herp 2
UAM:Inv 9
UAM:Mamm 6699
UAMObs:Ento 53
UAMObs:Mamm 7
UCM:Fish 21
UCM:Herp 442
UCM:Mamm 26
USNPC:Para 1
UTEP:ES 12
UTEP:Ento 20
UTEP:Herb 15
UTEP:Herp 300
UTEP:HerpOS 1
UTEP:Inv 20
UTEP:Zoo 23
UWBM:Herp 2

and data

create table temp_undoctypes as select guid from flat,identification where flat.collection_object_id=identification.collection_object_id
and identification.nature_of_id='type specimen'
and not exists (select collection_object_id from citation where citation.collection_object_id=flat.collection_object_id and type_status like '%type')
order by guid;

temp_undoctypes.csv.zip
https://github.com/ArctosDB/arctos/files/3741268/temp_undoctypes.csv.zip

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM4JR6XYZK3N7KEY6T3QPDHARA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEBRQIHI#issuecomment-543360029,
or unsubscribe
https://github.com/notifications/unsubscribe-auth/ACFNUM7FDJV5KRVR3LHQXYTQPDHARANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

important to flag it

How important? What's there (apparently-incorrect data aside) requires some knowledge of an arbitrary administrative process, at least two queries to find all the types, and displaces methodology from "nature." What I laid out works like everything else, doesn't require your techs to assess anything ('unknown' is always an option), doesn't need a specific publication, and there's a place for method-based 'nature' if it is known.

In any case I can drop this out of the migration if that's what's best.

I'd argue that 'type specimen' is a method. It's a method that employs the
rules of nomenclature to assert this specimen is this taxon.

Why would it require 2 queries to find all the types- because some won't
have nature of id = type specimen? I can see that being a problem but it's
easily fixed by setting that field to = type specimen for all type
specimens. However, it'd be hard to enforce consistency though, people
could use the citation / holotype option and for the nature of ID choose
something besides 'type specimen'

You say that what you laid out "doesn't need a specific publication" - how
then can one specify a specimen is a type without creating a citation from
a publication?

Another option, and what I've done in databases I've made and used in the
past, is to have a field dedicated to 'type status' and most specimens
would be 'none' but all the type specimens would have something in that
field like 'holotype' etc.

Is there a Darwin core field for that?

-D

On Thu, Oct 17, 2019 at 4:36 PM dustymc notifications@github.com wrote:

important to flag it

How important? What's there (apparently-incorrect data aside) requires
some knowledge of an arbitrary administrative process, at least two queries
to find all the types, and displaces methodology from "nature." What I laid
out works like everything else, doesn't require your techs to assess
anything ('unknown' is always an option), doesn't need a specific
publication, and there's a place for method-based 'nature' if it is known.

In any case I can drop this out of the migration if that's what's best.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUMYZAD7ECWH7FFRAQKTQPEAKDA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEBSAQOA#issuecomment-543426616,
or unsubscribe
https://github.com/notifications/unsubscribe-auth/ACFNUM7RWRY6TPWEOPYYS5DQPEAKDANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

argue that 'type specimen' is a method

I'm not sure that's WRONG, but there's also a lot of noise from folks who might want to find the resulting types when someone magicks a species out of DNA or photos or other nontraditional methods. As far as I know similar data have never been published - who knows, maybe nobody will use it, they definitely won't if we structure our data such that the information isn't accessible.

2 queries

"Real" types in Arctos require publications; that's the way most users will find them, and it's what I use for DWC and reporting and etc. (Yes DWC has a field.)

setting that field to = type specimen for all type specimens

I suppose that's a curatorial call, but it's definitely not something I'd recommend.

how then can one specify a specimen is a type without creating a citation from
a publication?

You can't, but it's easy to materialize publications. https://mczbase.mcz.harvard.edu/SpecimenUsage.cfm?action=search&publication_id=35944 exists to allow "normal" access to types without forcing you to actually track down the real publication. (I think they've got it a little too fine-grained, but the idea seems to work well enough - and the fake publications aren't holding thousands of records that REALLY don't look like types, which is nice!)

field dedicated to 'type status'

Yea, that's the norm. I don't think I've ever got much information out of one. I suppose we could spin up an attribute for that but then users would find it, use it to locate the few percentage of alleged types that aren't supported by publications, and leave thinking they've found what we have.

OK, I'm convinced.

Best to do things only 1 way.

-Derek

On Thu, Oct 17, 2019 at 6:34 PM dustymc notifications@github.com wrote:

argue that 'type specimen' is a method

I'm not sure that's WRONG, but there's also a lot of noise from folks who
might want to find the resulting types when someone magicks a species out
of DNA or photos or other nontraditional methods. As far as I know similar
data have never been published - who knows, maybe nobody will use it, they
definitely won't if we structure our data such that the information isn't
accessible.

2 queries

"Real" types in Arctos require publications; that's the way most users
will find them, and it's what I use for DWC and reporting and etc. (Yes DWC
has a field.)

setting that field to = type specimen for all type specimens

I suppose that's a curatorial call, but it's definitely not something I'd
recommend.

how then can one specify a specimen is a type without creating a citation
from
a publication?

You can't, but it's easy to materialize publications.
https://mczbase.mcz.harvard.edu/SpecimenUsage.cfm?action=search&publication_id=35944
exists to allow "normal" access to types without forcing you to actually
track down the real publication. (I think they've got it a little too
fine-grained, but the idea seems to work well enough - and the fake
publications aren't holding thousands of records that REALLY don't look
like types, which is nice!)

field dedicated to 'type status'

Yea, that's the norm. I don't think I've ever got much information out of
one. I suppose we could spin up an attribute for that but then users would
find it, use it to locate the few percentage of alleged types that aren't
supported by publications, and leave thinking they've found what we have.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM565ZBCLXH475DNAS3QPEOEPA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEBSIM6Y#issuecomment-543458939,
or unsubscribe
https://github.com/notifications/unsubscribe-auth/ACFNUM557RLEH7QKNI37VPDQPEOEPANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

Sweet!

Lacking better ideas materializing in the very near future, I will

  • create a new type status "unknown type" (so it'll show up in the "any TYPE" search) definition "DO NOT USE! Placeholder for records which were flagged with identification.nature_of_id type specimen, and which cannot currently be resolved to more specific Types and publications."
  • create a 'placeholder for type specimens' publication
  • create a citation using those data for all NoID=type specimen records which do not have an existing %type citation
  • file a 'please clean this up' issue (I think many/most of these are noise, not actual types)

Reasonable?

I'd like to revise the above.

There are ~16K specimens with a NoID of "type specimen" which do not have a corresponding 'type' type_status, but >10K of them do have a publication (usually citing 'voucher') on the 'type specimen' identification. I think those are clearly just using 'type specimen' (as NoID) in a slightly different way, NOT attempting to use NoID as a placeholder for type specimens. Unless someone has a compelling reason to do otherwise, I will ignore those and only create citations (using the new fake publication) for the ~5K citations which use NoID "type specimen" without a corresponding publication.

Out of curiosity, are there records that have NoID of 'type specimen' and have a type_status other than 'voucher' - i.e., ones that are actual types where it is a placeholder for type specimens?

Can you send me a csv of the 130 MVZ:Bird records that have NoID of 'type specimen' so I can look at those? Thanks.

are there records that have NoID of 'type specimen' and have a type_status other than 'voucher'

col type_status format a20;
select  citation.type_status, count(*)  from 
 flat,identification,citation
 where
 flat.collection_object_id=identification.collection_object_id and
 identification.identification_id=citation.identification_id and
identification.nature_of_id='type specimen' and
citation.type_status!='voucher'
group by citation.type_status
;


TYPE_STATUS        COUNT(*)
-------------------- ----------
holotype            929
isosyntype            2
basis of illustratio        108
n

paratype           2725
erroneous citation       53
isotype              50
isolectotype              4
lectotype            45
symbiotype           78
neotype               3
referral            197
isoneotype            1
paratopotype           1037
syntype              64
host voucher            423
paralectotype            30

placeholder

Those are ACTUAL citations; I'm using the word "placeholder" for things that don't have citations (yet - they'll get them, but against a fake publication) but have some indication that they might be types. (So now the data essentially say "this is a type, here's the publication" and "this is a type, we just say so." This change will provide the possibility to turn the latter into "this is a type, it's based on features, it's a really great candidate for a molecular study!")

csv

It's attached to https://github.com/ArctosDB/arctos/issues/2170#issuecomment-543360029

All catalog records with a 'type specimen' NoID are now attached to a publication; those that weren't already attached to something are on http://arctos.database.museum/publication/10008933

Thanks!

We need the new 'confidence' field in the data entry screen.

-Derek

On Fri, Oct 18, 2019 at 9:47 AM Carla Cicero notifications@github.com
wrote:

Thanks!

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ACFNUM6B5QBGIEQZZRKETP3QPHZBVA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEBVJ3OI#issuecomment-543858105,
or unsubscribe
https://github.com/notifications/unsubscribe-auth/ACFNUMZGPAMKXFIT6KNH3DDQPHZBVANCNFSM4IEIGLOA
.

--

+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960

[email protected]

phone: 907-474-6278
FAX: 907-474-5469

University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
http://www.uaf.edu/museum/collections/ento/
+++++++++++++++++++++++++++++++++++

Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us http://www.akentsoc.org/contact.php

Yes I’d like to see the mvz herps like that too

On Fri, Oct 18, 2019 at 9:19 AM Carla Cicero notifications@github.com
wrote:

Out of curiosity, are there records that have NoID of 'type specimen' and
have a type_status other than 'voucher' - i.e., ones that are actual types
where it is a placeholder for type specimens?

Can you send me a csv of the 130 MVZ:Bird records that have NoID of 'type
specimen' so I can look at those? Thanks.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
https://github.com/ArctosDB/arctos/issues/2170?email_source=notifications&email_token=ABCJF4NHXJ2EPFPMUQCXS33QPHOZHA5CNFSM4IEIGLOKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEBVACVA#issuecomment-543818068,
or unsubscribe
https://github.com/notifications/unsubscribe-auth/ABCJF4NZIGU6YUDJ6W2G3PLQPHOZHANCNFSM4IEIGLOA
.

>

Sent from Gmail Mobile

I don't know what went on here but you have turned 1 simple and understandable field into a mess. I would like to load records identified by students, I have entered 100s of records with the nature of ID "student" but now I assume it should be "course features" with a confidence level that is pulled out of my a**? Where is the confidence level field in the bulk loader menu? I need to change "student" to "course features" but there is no way to add a confidence level from the bulkload SQL page and it doesn't appear at all in the AJAX page.

Updates including bulkloader are done and the code table is cleaned up. It will take a couple days for everything to find its way to all forms. Closing this monstrosity; I opened a couple new issues for everything, I hope, that still needs handled.

confidence level that is pulled out of my a**?

I'd say that's NULL (='we have nothing informative to say, so we'll say nothing'), and at least earlier in this thread I was under the strong impression that that would be the case most of the time. Please prioritize https://github.com/ArctosDB/arctos/issues/2323 if there is an immediate need to assert confidence in data entry/bulkloaders.

@dustymc
Please edit the Edit Citation/ Create Citation And Identification tool so that the default Nature of ID is NULL. Currently it is set to audio-visual. All dropdowns should always have a null default.

Was this page helpful?
0 / 5 - 0 ratings

Related issues

Jegelewicz picture Jegelewicz  Ā·  6Comments

sharpphyl picture sharpphyl  Ā·  7Comments

acdoll picture acdoll  Ā·  8Comments

acdoll picture acdoll  Ā·  4Comments

dustymc picture dustymc  Ā·  3Comments