Showing posts with label quality content. Show all posts
Showing posts with label quality content. Show all posts

Wednesday, February 22, 2012

Are you plagiarizing?

Just because you're not copying someone else's work exactly as written, doesn't mean that you're not guilty of plagiarism. As writers who often rely on other sources for facts pertinent to our own news reporting, we must have an accurate understanding of exactly what is and what isn't plagiarism. Rather than relying on my own interpretation of the term plagiarism, let's look at a couple of independent sources.

The Instrument of Judicial Governance from the University of North Carolina defines plagiarism as "deliberate or reckless representation of another's words, thoughts, or ideas as one's own without attribution..." I added the bold for emphasis.

USLegal.com defines the term as follows:
"The intentional or unintentional use of another's words or ideas without acknowledging this use consitutes plagiarism..." Simply enough, right? As long as you, at some point in your article say what you've 'borrowed,' you're covered, right? Wrong.

USlegal.com goes on to say: "There are four common forms of plagiarism:
  • Duplication of another author's words without quotation marks and accurate references or footnotes.
  • The duplication of another author's words or phrases with footnotes or accurate references, but without quotation marks.
  • The use of another author's ideas in paraphrase without accurate references or footnotes.
  • Submitting a paper in which exact words are merely rearranged even though footnoted."
Again the bold is mine for emphasis of certain points. Let's go over these four forms of plagiarism, one by one. Then, I'll give four examples of published news articles reporting the same event to help understand how to follow these points in practice.

The first of these four, I think, is pretty clearly understood. Copying another person's work without crediting them is wrong and constitutes plagiarism. Few would have issue with that.

The second may be less widely known. Most might say, ok, I forgot quotation marks (or didn't know I needed them), but I correctly credited the source of the material I used, so it isn't plagiarism. Not quite, the purpose of the quotation marks is to set aside the borrowed material, so that readers know exactly what was borrowed from your cited source and what is your own creation. In the age of Internet, it may be permissible to use other forms to delineate exactly what was taken.

An indented bit of text, with a different background color, font, or size that is intentionally made to appear as though it were cut and pasted from another source, could, arguably substitute for quotation marks, so long as the source is properly attributed, for example.

The exact beginning and end of the quoted text should, however, be made unambiguously clear to the reader. Without quotation marks, the reader cannot tell what part of your writing is yours and what, exactly, is someone else's creation.

The third bullet point is where I, unfortunately, see either much ignorance, confusion or disregard for the definition of plagiarism. It is plagiarism to read an article written by someone else, and rewrite it in your own words, without fully, specifically, and properly citing that source, according to the USLegal.com definition. Reading a New York Times report that says:
"Among the more than 80 people that activist groups reported killed by rockets and bombs through the day, two were Western journalists, the veteran American war correspondent Marie Colvin, who had been working for The Sunday Times of London, and a young French photographer, Rémi Ochlik." Does not give you liberty to write in your article: "A French photographer and a Sunday Times of London reporter were killed during fighting in Syria," without directly citing the source of that specific piece of information. Furthermore, citing the NYT article once at the beginning of your piece does not give you leave to paraphrase the entire article or major sections of it, or to paraphrase it in multiple places throughout your article without further attribution. For example, on the first page of the above-referenced NYT piece, you'll find three specific references to information that the authors gleaned from one or more Reuters reports, as well as other citations for other facts and statements that they used.

Lastly, the fourth bullet of the USLegal.com definition tells us that we can't shirk our responibility to use quotation marks and specific attribution, just by rearraging the words. If some wrote "The police apprehended the suspect at 4:30 pm, on Saturday." You can't simply write "On Saturday, at 4:30 pm, police apprehended the suspect," without quotation marks and accurate references or footnotes. That still represents the original author's idea and words, not yours.

Here are examples of the same event reported by four separate sources.
1) WMUR report of a harp seal on a Hampton, New Hampshire beach published at 2:33 pm on February 19th
2) A report of the same event published at Patch.com published on February 20th
3) My own report of the event at Examiner.com published late in the day on February 19th 
4) A report from Foster's Daily Democrat published on February 21st

(Note: Go read them, but don't leave nasty comments accusing anyone of anything, please. My purpose in writing this isn't to call anybody out, but to educate based on some convenient examples that I had on hand.)

Of these four, the WMUR report was the first published. I was at the scene and have first-hand knowledge that residents attending the event contacted WMUR. From the wording of the WMUR report, it also appears that the author also contacted The New England Aquarium for additional information although they did not specify that source by name.

The second example needs a bit more scrutiny. The first paragraph, appears fine from a palgiarism perspective. There's a sentence containing two factual statements and a proper attribution to the WMUR article that I used as example #1. Note, however, that the Patch.com article says the seal "washed up" on the beach, which is a factually inaccurate assumption by the author based on the WMUR report that  the seal "was out of the water."

The first sentence of the next paragraph at Patch.com violates the second bullet point of the USLegal.com definition of plagiarism. It is a word for word copy of the statement made by WMUR without quotation marks, but properly referenced. This gives the impression that the Patch.com author created more of that sentence's wording than "the television station said." The rest of the Patch.com article's second paragraph is a combination of direct quotes and paraphrasing of the WMUR article's third paragraph. The Patch.com author has done nothing original here except to change one perfectly good WMUR sentence into a fragment, and change the source of the pronouncement of a healthy diagnosis that WMUR attributed to The New England Aquarium, to the volunteer (from the Blue Ocean Society) who never, in fact, spoke to WMUR before the WMUR article was initially published.

The first sentence of the third paragraph of the Patch.com article is lifted directly from the WMUR article's second paragraph although the Patch.com "author" changed the word "common" from WMUR's report to "not unusual" in his. He also dropped the words "when they want sleep or" from the WMUR report, otherwise copying their sentence verbatim. There is no attribution at all for the ideas or the copied words in the third paragraph of the Patch.com article. The next Patch.com "sentence," (I use the term lightly, since it is actually a sentence fragment) is another assumption by the author guessing at the emotional state of the onlookers that WMUR said were present. The remainining sentence and a half of the Patch.com article are original, linking two earlier articles by the same author on related topics (although I can't vouch for the originality of those two linked articles).

Plagiarism? According to the USLegal definition, unequivocably. Was the author aware that he was plagiarizing? Given the grammatical errors in the article, one could argue that he was unaware of his crime, which, given USLegal's "intentional or unintentional" verbiage, would make no difference in court, by the way.

The third example is one that I wrote published after the WMUR report. It however, contains much information not contained in the WMUR report, based on my personal knowledge and first-hand reporting, as I personally witnessed the events and spoke directly to the volunteer from the Blue Ocean Society who was alerted to the seal's presence by the New England Aquarium. There are no phrases or ideas lifted from the WMUR report. Paraphrased information from The New England Aquarium is properly cited. The facts in the final paragraph were written from my own knowledge, but since they may not be common knowledge to a majority of readers, in retrospect, they should have been cited from some independent source, to avoid any appearance of impropriety.

The fourth article, from Foster's Daily Democrat, was written by someone who was clearly not present at the location. It is quite possible that the author of this article was alerted to the situation by reading one of the previously published articles, but they contacted original sources, spoke to a named individual at The New England Aquarium extensively as well as the Blue Ocean Society volunteer. Although this article uses sources common to other articles, the New England Aquarium and Patty Adell, it is clearly original. The Foster's reporter may (or may not) have gotten the idea for the story from another published source, but he did his own reporting and did not simply rehash any existing articles that he may have read. 

You Don't Have to be On the Scene to Write About It

These examples consist two articles created by people involved directly at the time the reported event was occurring (one at the scene, one by telephone), and two created by people who apparently learned of the event afterward. You don't have to be at the scene to write about an event, but you do have to do more than repeat what someone else has already written, whether or not you paraphrase their exact words - even if you cite your source.

If you create even a properly attributed article based on other published sources and every sentence refers to another source, then, quite aside from issues of palgiarism, you should ask yourself exactly what you are bringing to the table. What are you offering the reader?

It might be a unique analysis of the events, a comparison of the event's coverage by several different sources, additional information not contained in previous reports, a more complete report than is found elsewhere by amalgamating information from a variety of sources, or anything else that makes your article uniquely your own and of unique value to the reader rather than a simple rewrite of someone else's work. Even if your editor allows simple rewrites to slide by, it's not your work and you have no business publishing it to steal traffic from the original source.

Sunday, October 9, 2011

Good Journalistic Advice from the Howard 100 News Team

For those with satellite radio subscriptions who listen to the Howard Stern Show, the Howard 100 News team actually gives a solid piece of journalistic advice, albeit in a humorous fashion. The tag line they use in one of their promos is "If you mother says she loves you, check her sources."

While I take it on faith that my mother loves me, after all she is giving first hand testimony to that fact, when it comes to news-worthy leads, I check everything. Recently, for example, It was suggested that I write a news story about October 5th being the most popular birthday in the USA by the News Director of the Yahoo! Contributor Network. He generally provides me with pretty solid leads, sometimes offering me contact info. for specific news-worthy leads and suggesting article topics that he'd like to pitch to the Yahoo! News front page folks.

Sometimes they pan out and sometimes they don't (and sometimes they require extraordinary effort on my part: "If you can find someone who was present at 'X' event 60 years ago, we'd like to publish an interview with them."). That's great, I am thankful and lucky to have such a good relationship with the person who decides whether to pay me for an article. I pursue the leads offered to the best of my ability whether I think they are likely to be profitable to me or not, often investing many hours in research before I really know whether there is a story there or not.

The October 5th birthday story was no different. I accepted the challenge and went to work, assuming that the premise was true. Indeed, at first it seemed that every web reference agreed. October 5th is the most popular date for birthdays in the USA because it coincides with a New Year's Eve conception, they said. Hmmm. I was born on October 2nd, which is certainly within the big part of the bell curve for those babies conceived on the same date as those born on October 5th. So I asked my mother, "Is it likely that I was conceived on New Year's Eve?"

"Yes," she replied that she was almost certain of that.

I noticed, however, that one source was cited by most of the websites and news stations which were reporting October 5th as the most popular birthday. Those that didn't cite a specific source had specific language or numbers that were too similar to those cited by that same, singular source to be coincidental.

Furthermore, the data upon which that original source based its claim was unavailable. It would have been easy to cite the same source used by such news outlets as NBC4-TV out of Washington, D.C., and all the others, but one source without the backing evidence just doesn't pass my standard of reliability even though that was the storyline requested by the News Director. So I asked myself, who might have actual birth record data and statistics? The government, for one. So I did a search restricted to .gov  sites and came up with the CDC data contained in my report which was inconclusive but tended to cast further doubt on the October 5th meme.

Looking further, I found a 2006 study by a Harvard professor who listed all 366 days of the year (including leap year's February 29th) in order of popularity, based on a study of many years of birth records. His data was more in line with the CDC data and was quite different than the source used by almost every website that talked about the most popular birthday in the U.S. I could have, and should have, gone one step further and called the Harvard professor who did the study and asked to see his data for a solid answer to the question, but by this time, my deadline was fast approaching and it didn't seem possible to make the contact and review the details of the study in time to publish the story at all.

With three sources in hand, but without the raw data to confirm their findings, I cannot say definitively upon which date the most popular birthday falls. I can say, however, that the October 5th date is pretty doubtful, and that's the story I went with. You can read it in its entirety at Yahoo! News by clicking the link below.

Tuesday, June 21, 2011

When to use your prior articles as primary source material

I've covered the difference between primary and secondary sources in previous posts and told you that you should always strive to use a primary source rather than someone else's reporting of that source material. Obviously, an original interview that you conduct is a primary source. Don't overlook that fact if you later write on a similar topic. You can cite the expert you interviewed with a link back to that interview as a primary source for future articles.

This is an exception to the rule. Ordinarily, I think linking back to one of your articles as a source of factual information for a news report is not a good idea if the information you're citing was originally drawn from another source. In other words, many articles (especially news articles) are themselves secondary or even tertiary sources.

If your previous article said, "The FBI, on their website, are asking for the public's aid in locating Whitey Bolger." That's a secondary source with the FBI website being the primary source. If you wrote "ABC News reported that the FBI Director asked for the public's help..." Your article becomes a tertiary source, ABC News is the secondary source for the primary source which was the FBI Director himself. These kinds of chain references where the reader who wishes to check your source has to go through several links to get to the source are damaging to the writer's credibility, in my opinion, and should be avoided.

However, articles in which you personally conducted the interview and are bearing witness to what the relevent source said to you, are fair primary source references. There is no extra link in the chain for the reader to chase down because you've done the original investigation and reporting and these articles can credibly be used as primary source references for future articles.

Note that this is for news source references. If you include links to other articles you've written as additional background material, further reading, related topics, or also of interest pieces, that's a separate thing altogether and I don't take issue with that at all.

Monday, May 23, 2011

Related blog post: Interviews

I just wanted to drop a quick note here that I have another blog called "Interviews with Experts." It is mainly dedicated to collecting all the interviews I have done with experts on various topics, but it also includes posts which discuss issues that have come up in the course of conducting interviews and my experiences around the interviews themselves. Since interviews are a key news data gathering tool, I am mentioning it here for those that are interested, but I'm generally not going to cross-post every entry from there to here.

Friday, March 25, 2011

What keyword density should I use?

Originally Associated Content told us to use 2-5% keyword density (or even 3-6% at one point). There was even a general town hall conducted with that advice. They suggested using bolded subheadings to get more keyword reiterations in without making the article sound awkward. That worked, and I did it for a while, but it often made the article difficult to read.

Using synonyms and words that are highly related to the topic works pretty well, too, and doesn't lead to the same sort of awkward constructions. Personally, I don't measure keyword density at all anymore, and I don't target keyword density as a goal at all. I focus on writing something that I would want to read. To me, that means that there is real content in the article, and it doesn't make my head hurt to read it.

Nothing ticks me off more than clicking an article titled something like "Obama Birth Certificate Update" and finding an article that says little more than:
"The Obama birth certificate is in the news again.
One of the top ten Google trends this morning was the Obama birth certificate as controversy continues to swirl around this topic. When first elected, many Obama opponents mounted a public information campaign to try to convince people that Obama was not eligible to be president because he could not produce the long form of the Obama Birth Certificate..."
and so on, with absolutely no new information.

As a reader, I expect (or at least hope) to learn something new every time I read a news article. If I can't find something to give the reader in exchange for clicking my clever article title, then I don't write it. Even when I was chasing keywords, I tried to provide some useful content. This example of one of my early AC News articles (when I was using keyword density measures as requested by the AC news department) is still drawing good page views every month after more than two years: It is noticeably keyword heavy, but is still readable.

It scored well on Google search results when first published, but with all the keyword spam competition for the term since then it is nowhere near the top at present. So how are people still finding it? I can find incoming links from a number of quality medical information sites referencing my article. It is not drawing page views because of the keyword density, but because it provides solid, authoritatively sourced information.

In my opinion, the latter strategy will be more effective for news writers in driving both short and long term page views under the new model.

Writing a very high quality, short, targeted piece for the Yahoo! Contributor Network (Y!CN) is much more likely to get picked up by Yahoo! News. That is where we can score really big page views. I have seen a few articles there pick up about 300,000 page views in just a couple days, and many more articles that hit 75,000 to 100,000 or more from a number of Y!CN writers.

My most popular Yahoo! News article to date was not on Google Trends at all when I wrote it, although its main topic did appeared there later. (I'll discuss the sources I use for breaking news story ideas in another post on another day.) It was also linked and tweeted by The Drudge Report and a number of other popular sites, and given a featured position by Yahoo! News. It had more than 2900 Facebook shares, and hundreds of Tweets. That will likely NEVER happen with something like the Obama birth certificate article I described above.

I think the paradigm for attracting eyeballs to AC News (and now Y!CN) articles has shifted, and, in my opinion shifted for the better, with the popularity of social media platforms like Facebook, with the Yahoo! acquisition of Associated Content (AC), and again with Google's latest algorithm change. Consistent creation of really useful material (from a reader's viewpoint) results in more opportunities, especially under the Yahoo! regime. With the old keyword dense articles, I averaged something like 2500-3000/ article over my entire library when I was writing several such news articles each day. I made the top 100 writers for AC for the year 2008 with just six months of membership and a library of about 190 articles in total.

Having stopped chasing keywords and Google Trends and starting to concentrate on delivering consistently higher quality, I now have an assignment desk so full of recurring article requests for publication at Yahoo! News that each come with guaranteed upfront payments in excess of my average per article earnings for the old keyword heavy news articles. That's without a single page view, and without the various miscellaneous one-off requests that come through. For me, the math is pretty clear. In short, Yahoo! and Y!CN want higher quality news content with a much lesser (if any) focus on keyword density and, most importantly, they are willing to pay for it.