Lawyering-up Research integrity

Spiderman’s lawyer is having you for dinner tonight

Spider researcher Jonathan Pruitt is accused by his coauthors of data manipulations, after 3 retractions they demand more. A lawyer's letter was supposed to stop that, but Pruitt tells me: "I'm happy for folks to engage in public discourse about my data integrity""

The Canadian shooting star of behavioural ecology Jonathan Pruitt used to be everyone’s best friend. Up until his co-authors accused the spider researcher of data manipulation and achieved retractions for 3 of his papers. His reputation in tatters, his research record facing even more retractions, his elite funding as one of Canada’s “150 Research Chairs” and prestigious job at McMaster University in danger, Pruitt did what every red-blooded academic in his position does: he deployed lawyers against his critics.

As Science reported, the lawyer’s letter was so scary that some journal editors panicked and hid under their tables, waving a white flag of surrender. No more Pruitt retractions unless specifically requested by the ongoing investigation at McMaster, no matter what the evidence submitted by co-authors proves.

The original recipients however, all of them former Pruitt co-authors, are not complying, they continue vetting Pruitt’s publications for further irregularities. As I learned, nobody signed anything or announced to adhere to any of lawyer’s demands. And why should they be afraid, the lawyer’s letter is not even really threatening anyone. It is a cack-handed attempt in intimidation which even Pruitt now apparently distances himself from.

All the letter from Millard and Company does, is to educate its recipient that nobody is supposed to be scrutinising Pruitt’s papers but his employer McMaster University, and that every retraction must be agreed by the Committee on Publication Ethics (COPE, a scholarly publisher lobbyist), whose advisory guidelines to its member journals are presented by Pruitt’s lawyers as a kind of a law for breaking of which you might end up in prison.

The letter is labelled “PRIVATE & CONFIDENTIAL WITHOUT PREJUDICE”, which is exactly why I publish it here in full, below. I hope the Millard lawyers are happy with this arrangement.

The Pruitt case made international news, it even got its own hashtag n Twitter: #PruittData. The communal effort to uncover data irregularities began with Kate Laskowski, now assistant professor at UC Davis in USA. Laskowski told the story on her blog, and it might explain the success recipe of Pruitt: when he met some scientist with an unproven pet theory, he provided them with the perfect experimental data. Which now proved to have been too good to be true.

In science, many beautifully designed theories collapse on experimental reality. Biology is very messy, complicated and rarely does as told, even and especially when the theory which biology is expected to adhere to is clever, beautiful and elegant, of the kind the editors in elite journals love and the research funders cherish. Which is exactly why so many scientists resort to research fraud, to force scientific reality to adhere to their theories.

Laskowski, then a PhD student, had a theory about animal behaviour in social context. She met at a conference the young shooting star Pruitt, who works with social spiders. Pruitt generously offered his collaboration, and soon perfect data arrived which perfectly proved Laskowski’s theory right. Pruitt’s datasets served as template for 3 papers in respectable journals, until someone contacted Laskowski with concerns. That colleague was Niels Dingemanse, professor at the LMU Munich in Germany, and he noticed there were statistically too many duplicate numbers in the paper’s publicly available original data file.

Pruitt told Laskowski he measured 4 spiders simultaneously, hence identical values. But how can spider act so synchronously and into 1/100 of a second?

Thus alerted, Laskowski started to go through Pruitt’s Excel files, and indeed found many repetitive numbers. Pruitt was quick with an explanation, which sounded a bit weird, but it kind of sufficed to consider a correction. But then Laskowski analysed the files in more details and found entire strings of numbers repeating themselves, with the only logical explanation that they were made up. Once Laskowski removed all repetitive numbers, the remaining ones showed no significant difference in spider behaviour between experimental treatments. Her theory collapsed, so Laskowski had her own three papers retracted, one by one.

Laskowski: “Each line is an individual spider. What I noticed is that the last two pre-treatment values, seemed to be identical to the first two post-treatment values in many spiders in this treatment. I colored those values red here. I also noticed that many values were very similar, but off by one digit in the hundreds column, e.g. 77.65 became 477.56. Those values I colored in yellow.”

In another blogpost on the topic, Dan Bolnick, editor of American Naturalist and professor at University of Connecticut, explains that the journals actually very much can be expected to take responsibility and to perform data integrity investigations, which his journal did. Bolnick even used to curate a publicly viewable (but not editable) Google Docs file to track the reliability of Pruitt co-authored papers. Some papers actually proved to contain no evidence of irregularities. The list was removed now, after Pruitt lawyered-up. Bolnick’s commendable behaviour stands in stark contrast to certain other editors who now hide behind Pruitt’s lawyer’s letter and the McMaster investigation.

Laskowski: “as opposed to being formatted in columns as on Sheet 1, these values were transposed – so that the columns became rows. […] Here on Sheet 2, the duplicate sequences were not just sequences of 4 numbers (as they were on Sheet 1), but rather I could now see that in fact there were whole blocks of duplicated sequences.”

Unlike his lawyer claims, Pruitt even agreed to the first retraction, which happened in American Naturalist. Laskowski quotes him with: “it is well that the paper is being retracted”. But eventually Pruitt changed his mind. Things got out of his control namely. After that retraction, other Pruitt coauthors reached out to Laskowski, the search for data manipulations became a communal effort. Many went on to publish their concerns on PubPeer, where other Pruitt’s coauthors first defended Pruitt, until some switched sides and joined the investigative group effort, like the associate professor Noa Pinter-Wollman at UC Los Angeles. Did these scientists also discover their own beautiful theories, once verified by Pruitt’s data, now in lying in tatters? Possibly.

One former Pruitt collaborator explained why his co-authors initially defended the accused:

It’s worth remember that lots of us considered ourselves to be good to very close friends with him. You just don’t want to believe something like this.

And then they looked into their own papers with Pruitt. A co-author shared this about the raw data behind a publication:

literally 1/2 the data were sequence repeats 10-14 units long but sometimes with select manipulation

That source also described the communal #PruittData investigation:

There are 28 channels, on the slack channel where people are looking at this, each of which is dedicated to a paper. […] The sequence repeat pops up a ton. Formulas are pretty common. One person found that a formula was used but it was like + 20.3 or something to better hide the pattern.

Rows K-O in the Pruitt raw data file “Black_Widow_Data” list independent experimental values for “Darkness”. And yet, values in row O are simply values in M +1.5. The author sloppily even left inside the formula with which the values were fabricated.

Pruitt’s manipulative approach over his former friends does not work anymore. If anything, people are afraid of his revenge now, rightly or not, time will show. As a source wrote to me:

Jonathan has a typical response when confronted.  He restates the issue, gives excuses that don’t directly address the question, often will say we should remove the questionable data and reanalyze, and then finally will agree to retract if the results don’t hold. This was the pattern until lawyers got involved that is.  

[…] When I was early in my career I was also just enamored with him and trusted him.  So when he comes dangling a dataset that would turn into a high level paper, you gladly take it and write it up.  Later it seemed like what would happen is that he’s send off just chunks of datasets to different people so they don’t see the whole thing.  Or he would talk to say a postdoc about an idea, the six months later the data would appear in that postdocs inbox….  so they don’t ever see the stuff being collected, but are thrilled to have the data and to get a famous person to work with.” 

Pruitt raw data file “Spider_StateBasedPersonality”, sheet “Anelosimus.2” Colour coded are repetitive number sequences

Much of #PruittData problems came out because Pruitt used to share his research data openly. Until he got caught that is:

Dryad is the data repository for open source stuff. Many journals have started requiring us to upload our data. Jonathan would do that sometimes, but not others. Or he would upload only a portion of the data. This started when one of his own grad students acted as a whistleblower and told another lab about some issues. That lab looked into a bunch of his data on Dryad and found all these issues.

But now, the lawyer’s letter. Which even Pruitt himself admits is not suitable to scare anyone. He namely wrote to me via Twitter direct message, obviously with reference to the Science article:

I realize the report made it sound like I legally carpet bombed people; that’s not the case. I know folks have interpreted the action as a flimsy fear-induced gag order. That’s not the case either. I’m happy for folks to engage in public discourse about my data integrity


PRIVATE & CONFIDENTIAL
WITHOUT PREJUDICE

[….]

Please be advised that our firm has been retained by Jonathan Pruitt with respect to the investigation (if any) being conducted by […] into the paper “[…]” by […] in 20[…]. I understand you have been contacted by Dr […] one of the coauthors, seeking a retraction.

Over the last number of weeks, there has been an online campaign directed at Dr. Pruitt and his work which has come to be known as “#pruittdata” and “#pruittgate”. As part of #pruittgate, a number of academics have organized an extra-institutional online forum for posting, reviewing, and criticizing the data integrity of various of Dr. Pruitt’s research studies and papers. It appears that Dr. […]’s concerns arise from that project.

Dr. Pruitt disagrees that there are any grounds for this paper’s retraction. Any decision about retraction must be conducted fairly and in keeping with professional norms as articulated by the Committee on Publication Ethics (“COPE”) Guidelines on Retractions. It does not appear that this paper meets any of the criteria for retraction as set out in the COPE Guidelines. In particular, the COPE Guidelines do not support retraction if “the main findings of the work are still reliable” or if “an editor has inconclusive evidence to support retraction, or is awaiting additional information such as from an institutional investigation.”

In addition, I am concerned that the procedural fairness of any investigation has been hampered by the manner in which Dr. Pruitt’s work is currently being criticized online. The social media campaign and online forums which spurred this complaint break with the normative practices of the field and very likely prevent a fair assessment of Dr. Pruitt’s work.

The controversy over this particular paper highlights the problems with using an unmediated online forum to attack the data integrity of a body of work. In this case, it appears that the wrong electronic file was uploaded to the forum. The final published paper does not rely on the impugned data.

If […] there is sufficient grounds to retract this paper, it must first notify Dr. Pruitt and provide him with a chance to respond. A premature retraction without Dr. Pruitt’s agreement is likely to further harm his reputation and negatively impact his career in an irreversible manner.

Please confirm that […] does not intend to take further steps at this time
with respect to the above noted paper. Otherwise, if the journal intends to take any further steps, please provide me with correspondence (1) detailing the journal’s concerns, including what evidence it has showing that the paper’s findings are no longer reliable, and (2) outlining the process the journal intends to follow prior to retraction, including when and how Dr. Pruitt will be permitted to respond to any allegations against him.
I am happy to discuss this matter further if you have any questions or concerns. If you would like to speak to me, please arrange a time through my assistant Lori Leblond (lori@millardco.ca, 1-416-920-3123).
Yours truly,

Marcus McCann

Update 19.05.2020. Seems Pruitt’s lawyer was successful. The Pruitt et al Proc Roy Soc B: Biol Sci 2016 paper, previously retracted on 26 February 2020, was stealthily de-retracted by the publisher The Royal Society in March. Instead, an Expression of Concern was published on 29. April 2020, announcing “An investigation into these aspects is under way“. This is the deleted retraction:


Donate!

If you are interested to support my work, you can leave here a small tip of $5. Or several of small tips, just increase the amount as you like (2x=€10; 5x=€25). Your generous patronage of my journalism will be most appreciated!

€5.00

45 comments on “Spiderman’s lawyer is having you for dinner tonight

  1. Somebody should inform spiderboy’s lawyers:

    Like

  2. Pingback: Here’s the letter Jonathan Pruitt’s lawyers have been sending to journals and his former collaborators #pruittgate #pruittdata | Dynamic Ecology

  3. Though Laskowski is completely innocent, it is not unreasonable to speculate that she got a faculty job from his questionable data, from which she got three published papers from, and she may have pivoted off that for more work. This gave her an advantage over other candidates that did not have her great publication record. I dont, however, seeing her rescind her faculty position! Why didn’t she collect her own data? Most of my ideas are wrong from the data that I collect myself, and I kind of wish the same thing happened to her to equal the playing field. But then again, maybe if this had happened, a full fledged cheater who fudged data could have then gotten the position.

    Like

  4. Pruitt’s co-author Nicholas DiRienzo now commented on PubPeer
    Pruitt et al Behavioral Ecology and Sociobiology (2011)
    https://pubpeer.com/publications/2BE2F156C5AC7C0C33939A612FDD9C#1
    https://drive.google.com/uc?id=1grJs92VKblsNro0rKGRKNRETuRakZUz7
    Pruitt explained it as

    “Lingering formula are present in some versions of data files when there contained missing values. Excel-based stats and other older stats programs don’t always contend with them well. It’s possible I uploaded a version with one of those present. […] If they change the results in an important way that changes the interpretation, then ax the paper/s. If not, then consider an addendum. It’s important not to throw the baby out with the bath water.”

    DiRienzo concludes

    “Myself and the other coauthors (excluding Dr. Pruitt) deem the data associated with this paper as being untrustworthy. There is no reason that such a formula should ever exist in a dataset that is supposed to contain individual behavioral measures, and I have yet to receive a satisfactory explanation as to why it was in there. Additionally, the discrepancies between the values contained in the Small and Large datasets cannot be explained by any simple excel or data input error. We have requested a retraction from the journal and it is currently being reviewed by the editors.”

    Like

  5. Looks like Pruitt’s lawyer letter may have had a chilling effect after all:

    https://pubpeer.com/publications/F62202DB4DB0804CD2E65C25764549#8

    It seems reasonable to draw a connection between the lawyer letter and the apparent posting and removal of a retraction notice.

    Like

  6. Pingback: Interview with JBC research integrity manager Kaoru Sakabe – For Better Science

  7. Pingback: Mr ACE2 Josef Penninger, Greatest Scientist of Our Time – For Better Science

  8. Klaas van Dijk

    Copy/pasted from https://www.sciencedirect.com/science/article/pii/S0003347213002492 :

    “RETRACTED: Linking levels of personality: personalities of the ‘average’ and ‘most extreme’ group members predict colony-level personality, Jonathan N.Pruitt, Lena Grinsted, Virginia Settepani

    This article has been retracted at the request of the authors LG and VS. It has come to the authors’ attention that the data on spider boldness reported in this paper appear to contain irregularities in the form of an excess of duplicated values. These duplicated values even span across time points. For example, the boldness value 63.21 sec occurs in both boldness 1, 2 and 3 which were assayed at different times. Furthermore, extensive overlap has been found between this data and that from a different article published the same year: 74.3% of the boldness values are identical to those found in the other publication even though different spiders were assayed. On this basis, the authors LG and VS have lost faith in the reliability of the data and they wish to retract the paper. Author JNP does not agree to the retraction.”

    Like

  9. Klaas van Dijk

    Copy/pasted from https://royalsocietypublishing.org/doi/10.1098/rspb.2020.0255 :

    “Retraction: The Achilles’ heel hypothesis: misinformed keystone individuals impair collective learning and reduce group success

    N. Pinter-Wollman , C. M. Wright , C. N. Keiser , A. DeMarco and M. M. Grobis

    Published:29 July 2020

    The authors of the paper by Pruitt J.N., Wright C.M., Keiser C.N., DeMarco A., Grobis M.M., & Pinter-Wollman N. `The Achilles’ heel hypothesis: misinformed keystone individuals impair collective learning and reduce group success’ published in 2016 in Proceedings of the Royal Society B283: 20152888 were recently made aware of problems in the raw data collected by Pruitt J.N. In particular, 86% of the values of the rewarding stimulus in time 9 are separated by an integer (11) from the values in time 11 of the rewarding stimulus. Similarly, 82% of the values in time 10 of the rewarding stimulus are separated by an integer (3) from values in time 12 of the rewarding stimulus. In addition, 21% of the values in time 2 of the rewarding treatment are separated by a fixed integer (8) from values in time 1 of the dangerous treatment.

    Furthermore, in the data on individual learning, 80% of the values from time step 5 in the rewarding trial are separated by exactly 10 from the third time point in the unrewarding trials. Similarly, 80% of the values from time step 6 in the rewarding trial are separated by exactly 11 from the second time point in the unrewarding trials. These irregularities are focused on the keystone and generic individuals and no such irregularities are found in the control individuals.

    Reanalysis, with these irregularities removed, no longer provides clear support for some of the main conclusions of the study. The authors therefore wish to retract the paper.”

    Note that Jonathan Pruitt is not listed as author of this retraction note and note that this information is not listed in this retraction note. This retraction note also does not provide details about the whereabouts of the raw data which were removed because of “irregularities” / “problems”.

    Like

  10. Klaas van Dijk

    And another Retraction (published in July 2020). Copy/pasted from https://www.sciencedirect.com/science/article/pii/S0003347220301342 :

    “Retraction notice to “Individual differences in personality and behavioural plasticity facilitate division of labour in social spider colonies” [Animal Behaviour 97 (2014) 177–183]

    This article has been retracted at the request of the Editors and authors. After scrutinizing the raw data associated with this paper, the first and second authors discovered irregularities in the form of duplicated data sequences. In particular, identical behavioural profiles were repeated across individual spiders belonging to colonies that were purported to be independent. As these anomalies cannot now be explained, these authors consider the results to be unreliable. They have, therefore, retracted the paper. The data were collected by the third author, who agrees to the retraction.”

    Once again “irregularities” and “anomalies” in data which were collected by Jonathan Pruitt and which could not be explained (by Jonathan Pruitt?).

    Like

  11. Klaas van Dijk

    And an Expression of Concern of another paper in Animal Behaviour. Copy/pasted from https://www.sciencedirect.com/science/article/abs/pii/S0003347220301706 (published in August 2020):

    “Several concerns have been raised about the study’s integrity and data validity. This expression of concern is to inform readers about the potential issues related to this article. An investigation has revealed that the boldness measures, although not used in this study, contain some anomalous patterns of decimals in recorded latencies. These measures were supposed to be taken with a stopwatch, and thus one would expect the decimals that were recorded to the hundredth to be effectively pseudo-random. Yet, the centi decimals present are commonly repeated. Simulations show that these patterns of repeats should virtually never occur naturally, which raises questions regarding their validity and accuracy. Although this suspect measure is not used in the paper, it was collected as part of the aggression trials, of which other measures were used. Dr. Pruitt has not provided a sufficient explanation for this issue. Given this, Drs. DiRienzo and Hedrick would like to make the readers aware of these unexplained data anomalies. For a full description of the history of the datasets and the investigation done by Dr. DiRienzo, please see this post on PubPeer:https://pubpeer.com/publications/00E431B139DAF33455AC1FC2418F0F

    First author Nicholas DiRienzo wrote on 23 March 2020 at this Pubpeer thread:

    “I agree that the small dataset that is mainly integers makes it hard to detect anomalies (which is why I focused on the boldness data). I do find the three mass/boldness increases curious. I hadn’t found those, but I was trying to avoid sorting the data every to reduce the chances of finding false positives.

    It’s hard for me to give this portion of the dataset an endorsement beyond “I think they’re trustworthy.” Another way I could put it is “I couldn’t find any giant red flags, but I’m skeptical.” I made these posts as science demands trustworthy data, and thus even just even having to ‘think’ if they’re trustworthy should give readers pause.”

    First author Nicholas DiRienzo wrote on 5 August 2020 at https://twitter.com/Niku_DiRienzo/status/1290997804706328576 :

    “1 of 5 resolved. Not a retraction, but did get a Expression of Concern published highlighting the data anomalies. Please read the description below and check out the PubPeer post for more details.”

    “I think Animal Behaviour has been incredibly bold. There’s been a lot happening behind the scenes and they’ve powered through everything to do what’s best in each situation. Now, other journals… well, some have had their own best interests in mind.”

    “The other issue is that some are treating each paper as an isolated incident as they’re afraid of getting sued even though they should consider the whole of the situation.”

    “Some are also only concerned about the influence of the anomalies on the results. If they don’t hold with the anomalies removed then they retract, otherwise they just allow (force) a correction. This is why we’re seeing corrected papers with half the data removed.”

    Like

  12. Klaas van Dijk

    Copy pasted from “Bolnick, A. Sih, N. DiRienzo & N. Pinter-Wollman, 2020. Authorship removal correction for ‘Behavioural hypervolumes of spider communities predict community performance and disbandment’,
    https://doi.org/10.1098/rspb.2020.1852 (published: 19 August 2020)”:

    “The authors of this correction re-examined data generated by co-author J. N. Pruitt that contributed to the conclusions of the 2016 publication cited above, in a file not originally published with the paper. The file included waiting times (in seconds, ranging from 0 to 600) for individual spiders to initiate a behaviour. We found that certain integers (34, 45, 56, 67, 78 and 89) were substantially over-represented relative to adjacent integers, beyond what would be expected from reasonable statistical distributions. In the light of this over-representation, D. I. Bolnick, A. Sih, N. DiRienzo and N. Pinter-Wollman remove their authorship from this paper.”

    Copy/pasted from https://twitter.com/Niku_DiRienzo/status/1296070468491853824 (19 August 2020):

    “[1/3] 3rd #Pruittdata paper dealt with. In this case editors would not let us retract and instead wanted the corrections suggested by Dr. Pruitt. This includes rounding the values to the nearest increment of five and reanalyzing https://doi.org/10.1098/rspb.2016.1409

    [2/3] There were several other issues not described in the letter that killed my personal confidence in the data. Ultimately, given no good explanation was ever given for how the anomalies occurred, myself and coauthors were unwilling to reanalyze and submit a correction.

    [3/3] The only option was to remove ourselves from the paper (and even this proved difficult). As we’re no longer authors. Jonathan Pruitt will be allowed to reanalyze the data and submit a correction on his own.

    Deborah Leigh: will it go through another round of reviews though? And will the issues you have with the paper be disclosed to the reviewers? I think that is important! Sorry you couldn’t retract :/

    Nick DiRienzo: They’ve repeatedly told us to keep things confidential which only protects them and Pruitt, not the rest of us. I told them this paper would never pass peer review if these issues were known about at the start, so it should be retracted. That should be the standard but isn’t.

    Nick DiRienzo: [1/2] Also, I don’t want to pretend to be innocent. I should have been more concerned about the lack of some information (e.g. trial dates, trial order, etc.). I made assumptions without verifying and did my repeatability calculations off those.

    Nick DiRienzo: [2/2] Pruitt doesn’t have those data and doesn’t remember if they were ordered or unordered. The journal doesn’t seem concerned. This means we can’t account for order effects. They might not be there, but we won’t know.

    Nick DiRienzo: Here’s their report which they considered final. This is their summary after lots of back and forth. They only care if the anomalies influence the biological conclusions, not anything about how they came to be. https://drive.google.com/file/d/1oM5d0MHEzuAo39CdoJQvWqlaYnMPMYeQ/view

    Nick DiRienzo: (…) they’ve repeatedly asserted things are confidential (they’re not, btw). I don’t have a lot of hope for transparency moving forward.”

    So how to refer at the moment to https://doi.org/10.1098/rspb.2016.1409 ?

    Pruitt (2016, see also Bolnick et al. 2020)?

    Like

  13. Klaas van Dijk

    Copy/pasted from https://twitter.com/DanielBolnick/status/1296294691617615877 (20 August 2020):

    “Today some co-authors and I published a “correction” removing our names as authors on a paper https://doi.org/10.1098/rspb.2020.1852 This unusual move deserves explanation despite continuing moves by a coauthor’s lawyers that have had a chilling effect my expression of my scientific views

    So, I will limit myself to a few statements of fact: One of the datasets used to establish repeatability of behaviors (not originally posted on dryad) contained data on latency times for spiders to initiate a behavior (0 to 600 seconds)…

    As you might expect with latency times, the integer values were exponentially distributed. Except, a handful of integers were vastly over-represented relative to (1) exponential distribution expectations and (2) adjacent integers.

    I first noticed that the number 67 was >3x more common than 66 or 68. Odd, but not a clear signal of a problem. On looking for the other over-repesented integers, there was a pattern: each was a 2-digit number with adjacent values on a keyboad: 34, 45, 56, 67, 78, 89, 90.

    Each of these is ~3x more common than the flanking integers (e.g., 34 is more common than 33 or 35). I see no plausible biological cause for this pattern, and the probability of this pattern from exponential or other distributions is very low.

    The co-author who generated these data offered an explanation for this pattern, invoking unconscious observer bias of a few seconds here and there, and offered to redo the analyses rounding all values to the nearest 5 (e.g., 34 –> 35, etc) & provide a correction.

    The journal requested such a correction. To my limited knowledge the editorial board has not ruled out a retraction if the submitted correction proves to be insufficient.

    I would ask that the community not trash the ProcB editorial board: this is not yet the final word on this paper, and I expect they are under intense legal pressure to move slowly and cautiously.

    As co-authors, we also were very cautious in coming to our decision to remove ourselves. We did this based on lengthy (weeks-long) collective examination of the dataset specifically for this paper, and was not reached lightly or gladly.

    We make no claims about how the data came to be flawed, but the origins of the data is secondary to the clear biological implausibility of the numbers (which we documented with extensive statistical analysis), which suffices to lead us to end our authorship with this paper.

    I post this thread, speaking as a co-author, because I believe:
    1. the scientific record needs to be amended when flaws are found,
    2. these changes need to be disseminated to the paper’s audience,
    3. that social media is an effective tool for disseminating scientific changes.”

    Like

  14. Klaas van Dijk

    Copy/pasted from:

    Pinter-Wollman N. 2020. Authorship removal correction for ‘The legacy effects of keystone individuals on collective behaviour scale to how long they remain within a group’. Proceedings of the Royal Society B 287: 201842. http://dx.doi.org/10.1098/rspb.2020.1842 (published online 19 August 2020).

    “Due to recently discovered anomalies in the data collected and curated by J. N. Pruitt for the paper by J. N. Pruitt and N. Pinter-Wollman, ‘The legacy effects of keystone individuals on collective behaviour scale to how long they remain within a group’ published in 2015 in the Proceedings of the Royal Society B282, 20151766, which include formulas in the raw data file, duplication of portions of the data and an excessively high number of repeated values beyond what would be expected in an unbiased sample, N. Pinter-Wollman removes her authorship from this paper.”

    So how to refer at the moment to http://dx.doi.org/10.1098/rspb.2015.1766 ?

    Pruitt (2015, updated, see Pinter-Wollman 2020)?

    Pruitt (2020, see also Pinter-Wollman 2020)?

    Like

  15. Klaas van Dijk

    There are several ways how standardized observations on the social behaviour of spiders, in the field and/or in a lab, can be recorded.

    (1): write them by yourself in a note book / field diary / on a standardized form etc. This can be done when you are alone;
    (2): use an audio-recorder (various possibilities, including video, wildlife camera, etc.) to record your own spoken observations. This can be done when you are alone;
    (3): collect the data together with one or more others who write down your spoken observations in a note book / field diary / on a stadardized form etc.;
    (4): use a (modified / simplified) keybord (which it attached to a computer) to enter by yourself the (standardized) observations;
    (5): collect the data together with one or more others who use such a (modified / simplified) keybord to enter your spoken observations.
    etc.

    This list is incomplete. All methods produce raw research data, also called primary research data. I have huge piles of them (note books / field diaries / standardized forms). I only need a scanner / digital camera / mobile phone to make digital copies of these pieces of paper and send these copies to anyone who would like to have a copy (or post them online).

    A committee of Harvard University who investigated the serious allegations of research misconduct by Marc Hauser had for example access to a large amount of video/audio tapes taken during observations of monkeys, see for details e.g. https://www.harvardmagazine.com/2012/09/hauser-research-misconduct-reported

    I was wondering if any of the co-authors of the Pruitt papers which are at the moment under discussion has even seen (parts of) such kind of raw research data. Am I right that all these co-authors have until now only seen Excel-files etc. with a summary of these raw research data / primary research data?

    Like

    • As far as I can tell the Pruitt collaborators had a hypothesis, told Pruitt, and Pruitt collected all of the data for them to test the hypothesis, and e-mailed in a spreadsheet. Gee, I wish someone else would collect the data I needed for a paper and then all I would need to do is write it up and get it published, and really build up my CV real easy! No wonder Pruitt had so much influence, he made up data that others were to irresponsible to collect on their own, who could use the data to nicely pad up the CV’s for promotion. Nice scam going here…..

      Like

  16. Klaas van Dijk

    Copy/pasted from https://academic.oup.com/beheco/advance-article-abstract/doi/10.1093/beheco/araa073/5898590

    “Expression of Concern: Spider aggressiveness determines the bidirectional consequences of host–inquiline interactions, Behavioral Ecology, Published: 28 August 2020

    At the request of the Editor-in-Chief, the authors of the paper “Spider aggressiveness determines the bidirectional consequences of host–inquiline interactions,” by Keiser and Pruitt published in Behavioral Ecology in 2014 ( doi.org/10.1093/beheco/art096 ) recently investigated the raw data associated with this manuscript using visual inspection and the R package SequenceSniffer ( https://github.com/alrutten/sequenceSniffer ). Two irregularities were discovered: The presence of these anomalies renders this subset of behavioral data unreliable. The conclusions drawn in the original article are robust to reanalysis after excluding these data anomalies from the file available at Figshare ( https://figshare.com/articles/Data_for_Keiser_and_Pruitt_2014_-_Behavioral_Ecology/11778552 ). Nevertheless, because of the presence of these anomalies, readers are alerted to our concern over the integrity of the remaining data. We will publish a more complete correction once the remaining data have been verified.”

    Copy/pasted from https://twitter.com/HiDrNic/status/1299395150968311808 (co-author Nick Keiser):

    “Damn, this is not the reason I wanted to get back on twitter, but things need to be said. Our newest Expression of Concern is out in Behavioral Ecology, detailing problems we found with Keiser and Pruitt 2014 BehavEcol. [thread]

    A teaching tool in the lab where I did my PhD was that our advisor gave us a previously-collected dataset for our first research rotation which we use to learn statistics and scientific writing. I actually think this is an awesome practice (I’ve done it with 2 of my students).

    These data were collected by my advisor in 2010 in Tennessee and given to me, their first graduate student, to analyze and write up in Fall 2012. I loved this paper. It taught me how important individual differences in behavior could be for community ecology.

    This paper taught me what “inquilines” were (a type of social parasite) and I fell in love with social spiders and the diverse ecological communities they host in their colonies. This study became part of the foundation for how I viewed species interactions.

    This spring, we investigated the data with tools like SequenceSniffer (something I would have never known about eight years ago as a first-year PhD student). The errors we discovered are outlined in the EoC linked above. I was devastated.

    This EoC is the product of months of work, and countless reanalyses and emails back and forth. If you’re wondering how these types of discoveries result in Expressions of Concern vs. Retractions vs. Corrections, I don’t think I am at liberty to discuss. I wish we could.

    I’ll end with the final sentence of the EoC: “Nevertheless, because of the presence of these anomalies, readers are alerted to our concern over the integrity of the remaining data. We will publish a more complete correction once the remaining data have been verified.””

    Copy/pasted from https://twitter.com/dornhaus/status/1299411607362121728 (Anna Dornhaus):

    “The answer, if anyone wants to know, is lawyers.” (28 August 2020).

    Like

  17. Klaas van Dijk

    Copy/pasted from https://twitter.com/EvoSarah/status/1329215444272750592 (19 November 2020):

    “Unfortunately, I have to join the list of authors retracting a paper on spider behavior. My cosigning authors and I discovered some anomalies, which were confirmed through an investigation by @ASNAmNat https://journals.uchicago.edu/doi/10.1086/712483

    Copy/pasted from https://www.journals.uchicago.edu/doi/10.1086/712483

    “Retraction

    James L. L. Lichtenstein, Ambika Kamath, Sarah Bengston, and Leticia Avilés

    The article “Female-Biased Sex Ratios Increase Colony Survival and Reproductive Output in the Spider Anelosimus studiosus,” published in the November 2018 issue (pp. 552–563), is retracted by request of The American Naturalist and the authors of this retraction statement. Investigation by the journal, which was fully examined by all authors, revealed a 43-row repeated sequence in the sex ratio estimation data and nearly identical data distributions in colony size across collection sites. These anomalies in data collected by Jonathan N. Pruitt could not be explained, undermining the credibility of the article’s results. On this basis, the authors of this statement regretfully retract this article.

    Electronically published November 18, 2020”

    Jonathan Pruitt is not mentioned in this list of authors of this retraction note. Does this imply that Jonathan Pruitt does not agree with this retraction?

    Like

  18. Klaas van Dijk

    Copy/pasted from https://royalsocietypublishing.org/doi/10.1098/rsbl.2020.0588

    “Retraction:
    Behaviour, morphology and microhabitat use: what drives individual niche variation?
    Raul Costa-Pereira and Jonathan Pruitt
    Published:26 August 2020 https://doi.org/10.1098/rsbl.2020.0588
    See original article (Biol. Lett.15, 20190266. (Published online 5 June 2019) (doi:10.1098/rsbl.2019.0266 )

    Following publication of the article by Costa-Pereira & Pruitt, the journal was made aware of potential problems with the associated raw data collected by Pruitt J. An investigation into the data detected duplicated values and as such, results drawn from these data cannot be considered reliable and the journal has decided to retract the article accordingly.

    Biology Letters / © 2020 The Royal Society / Published by the Royal Society. All rights reserved.”

    Like

  19. Klaas van Dijk

    Copy/pasted from https://www.journals.uchicago.edu/doi/10.1086/713144 : “The article “Iterative Evolution of Increased Behavioral Variation Characterizes the Transition to Sociality in Spiders and Proves Advantageous,” published in the October 2012 issue of The American Naturalist (pp. 496–510), is retracted by The American Naturalist and the authors of this retraction. Investigation by the journal, which was fully examined by all authors, revealed duplicate ratios of spider mass to prey mass, which was often an exact value (e.g., 0.3, followed by 0.33 and 0.34 exactly). Similar anomalies, with redundant predator-prey mass ratios, were also found for other size categories. These anomalies in data collection by Jonathan N. Pruitt could not be explained, undermining the credibility of the article’s results. On this basis, the authors of this statement regretfully retract the article. (…). Published online January 19, 2021”.

    Christopher Oufiero, Leticia Avilés and Susan Riechert are listed as authors of this retraction note.

    Like

  20. Klaas van Dijk

    Copy/pasted from a thread at https://twitter.com/Niku_DiRienzo/status/1357369889489494017 (posted on 4 February 2021):

    “Figure I’ll give a one year #pruittdata #pruittgate update on two papers in BES: Lichtenstein et al 2016 and Pruitt et al 2011. Myself and coauthors requested retractions for them on 1/27/20 and 2/3/20 respectively after finding major data anomalies in both. I documented those issues here: https://pubpeer.com/publications/1084FEB633DD7021D4D5A954AACFBA , and here: https://pubpeer.com/publications/2BE2F156C5AC7C0C33939A612FDD9C

    The BES editorial board was initially highly concerned and recognized the issues. After extensive back and forth between myself/coauthors and Jonathan Pruitt about the anomalies in Pruitt-collected data, the board agreed to retract the papers on 09/04/2020. Since then I have written monthly check-in emails asking the status of the retraction statement. The typical response has been that this process is largely out of their hands. On 01/13/2021 the retraction text was sent to coauthors for approval, asking for a response by 01/20/21.

    Note that this text was only for Lichtenstein et al 2016, and not Pruitt et al 2011. When I asked about the 2011 retraction text I again received a response that it’s out of their hands at this point. Again, Pruitt et al 2011 had a formula in it to generate a new observation. Additional issues were single-cell differences between the two versions. Pruitt’s responses varied from the formula being used in a ‘sensitivity analysis’, to it being a different measure and not being used in the analysis despite stats/versions indicating otherwise. So even with these major issues which indicate clear manipulation the papers are still not retracted. Personally, I don’t think this is the fault of BES (they’ve been very tolerant of my pestering), but rather processes, foot dragging, and maybe legal actions at higher levels.

    Still, as someone who is out of the behavior field and leaving academia shortly, this whole process seems broken. I’m still shocked that ProcB allowed for us to retract our names and allow a shoddy reanalysis of clearly problematic data vs. just retracting the paper…

    I’m shocked that I have my name on two papers here that have obvious data issues with no valid excuse given in a year, yet they’re still out there. I get these things take time, but at some point the actual issue and solution is just being avoided.”

    Niels Dingemans responded on the same day: “On a positive note, journals not making a stand expose themselves as failing to protect the values we hold dear as a scientific community. That’s not bad: it helps us identify them, pushing us to refocus our editorial/reviewer time towards those that do protect our interests..”

    Source: https://twitter.com/DingemanseLab/status/1357410083064455174

    See https://www.springer.com/journal/265 for backgrounds about the journal in question. Lots of Associate Editors are affiliated to this journal, see https://www.springer.com/journal/265/editors How comes that these Associate Editors are unable to solve this issue?

    Like

Leave a comment