Building trust in media
CiteIt is developing new digital tools that help combat misinformation and selective quotations. These tools show the context surrounding the quoted media in order to build trust and understanding.
CiteIt is developing new digital tools that help combat misinformation and selective quotations. These tools show the context surrounding the quoted media in order to build trust and understanding.
Often when I’m reading a story in print, I come across a quote that makes me wonder: What was said in the sentences preceding or following the quote? In other words: What is the Context? Is this quote cherry-picked and can I really get a full sense for what happened from just the quoted selection?
CiteIt.net is the name I chose for a web service that allows writers to augment their writing with greater context about the quotations they make. The concept behind CiteIt.net is part of the tradition that started with footnotes, became hypercharged with hypertext links, and now has evolved to enable the words of the original sources to flow into the citing document, without requiring the reader to interrupt their reading experience by leaving the original document.
CiteIt.net was inspired by the work of Ted Nelson — who coined the term “hypertext” in 1962 and had the vision for a universal hypertext network, long before Bill Gates and Steve Jobs had even assembled their first personal computers.
While writing a review of Ted Nelson’s 2002 Ph.D. thesis, I was inspired to develop a way to write in a way more similar to what Nelson advocates. Ted’s design — Xanadu — features parallel texts that provide the full context of their sources. My design is a crude semblance of Xanadu, displaying only one pane of text in which the text surrounding a citation is “injected”. Because I do not have the technical ability to implement Ted’s vision and so much existing writing is available on the web as Html, I thought I could pursue Ted’s vision best by creating a quick-and-dirty extension to Html that would allow writers to start to create a collection of citation data. There is a saying from the 1989 movie “Field of Dreams —
If you build it –
– he will come. My approach is a data-centric one, that prioritizes the collection of citation data as a foundation that attracts others to the task of building a better version of hypertext. As the system develops, and as authors get used to the concept of pulling their quote context from the original source, it is my hope that the compilation of a public data set of citation data will attract great programmers to gradually rebuild hypertext, replacing my current Html cludge with advanced front and backend features.
–Tim Langeman (home page)
Akron, PA (USA)
Can you locate the source of any quote
When did you begin CiteIt?
Why is the context cut off mid-word?
Why did you release this as open-source software under a free license like the MIT license?
Is there a simple code demonstration of CiteIt I can inspect that is stripped of all the WordPress boilerplate?
What was Neotext?
If you have ideas or think you can help, send me an email. (email address found on homepage)
CiteIt has benefitted from being able to build upon the shoulders of the open source community. Here are a few of the people and projects that have had a significant influence on me or that CiteIt uses.
My long-term goal is to work on CiteIt full-time, ideally as part of an existing non-profit, such as the Internet Archive. The Internet Archive gets funding from public donations, as well as foundations. I've done a little bit of research into funding from foundations and the Knight foundation and Sloan foundation look like they might be a good fit because the Knight Foundation has partnerships with journalists with whom I would like to partner and the Sloan Foundation has funded work into the Universal Access to Knowledge and improving Wikipedia's accuracy and credibility . If you have ideas about organizations and funders that would be a good fit send me an email.
I've started to outline technical plans for the project (Google Sheets).
View Extended Plan Details (Google Sheets)(The stages of these plans do not line up exactly with the "phases" described below.)
Here are a few organizations/categories, I'm intestested in partnering with:
The first phase of the project involves providing context for authors, through expanding blockquotes or contextual popups..
I think that Judaism has the same problem that any thick civilization has in a world in which, as you say, context is stripped away. And not only is context stripped away, but attention to any one thing is scanter and less than it used to be. So, for example, a lot of Jewish commentary is based on your recognizing the reference that I make. Who recognizes references anymore? Because people don’t spend years studying books.
The project's path, depends upon which groups choose to adopt CiteIt first.
The python version of the webservice currently supports using digital PDFs as sources but the Docker-based public webservice does not yet support PDFs. Adding PDF support, both “digital” and scanned, to the Docker image is important to academic disciplines like history, as well as discussions of public affairs that cite many government documents.
There is a draft version of CiteIt which allows citing YouTube transcripts. More could be done to enable audio and video contextual citations.
A lot of work in the initial phase will need to be done to develop a full feature set and improve robustness. It is one thing to develop a proof-of-concept and quite another to develop a mature, production-tested service.
If CiteIt remains soley a citation app, I will consider it a success. The vision for further development is more speculative.
Get in touch with me if you want to start a conversation about the possibilities.
Authors’ citations could be added to a database when the authors index their pages. This database could be made available to aggregation services.
The second phase of the project involves taking advantage of the data collected from authors who use CiteIt to pull context into their webpages.
Citation data is of enormous value, as Google’s PageRank has shown. Adding source urls to specific quotations allows citations to become more granular and adding tagging and meta data allows citations to become more expressive and informative.
This database could be published using an API and programmers could develop competitive services to offer annotation, moderation, and search aggregation services.
Many people now recognize the value that links play in Google's calculation of the Page Rank algorithm which ranks search engine results.
CiteIt extends linking at a more granular level, enabling blockquotes and q-tags to link to a specific portion of a page.
Second Order Effect refers to the idea that every action has a consequence, and each consequence has a subsequent consequence.
Many people now recognize the second order effect of collecting cell phone location data allows Google to render a report of traffic speed.
Google Maps Traffic: measure driver's speed, from cell phone location.
The second order effect of collecting quote citation data is that it opens up the possibility of displaying the intersection of two documents at a granular level.
Just like cell phone location data enables Google traffic reports, linked quotations could enable web developers to design creative interfaces to display the quotes that intersect with specific poritions of a webpage.
Although a few sites have experiemented with inline comments, we are used to most comments coming at the end of an article (if a publication has enabled comments).
CiteIt would enable programmers to develop systems that could be like a cross of Google News meets inline comments.
Tagging could be added to quote tags and these tags could be used to moderate the linked content.
<blockquote cite="https://gimletmedia.com/shows/reply-all/o2hx34/127-the-crime-machine-part-i" data-citeit-cited-audio_url='https://soundcloud.com/replyall/127-the-crime-machine-part-i' data-citeit-cited-audio_start_time='157' data-citeit-cited-transcript_url='https://gimletmedia.com/shows/reply-all/o2hx34/127-the-crime-machine-part-i' data-citeit-citing-tags='reply-all-podcast, police, crime, crime-history, new-york-city, comsats, jack-maple' data-citeit-cited-title='The Crime Machine, Part I, Episode 127' data-citeit-cited-authors_names='PJ Vogt' data-citeit-cited-authors_twitter='@PJVogt' data-citeit-cited-authors_entities='Reply-All' data-citeit-cited-authors_entities_urls = "https://en.wikipedia.org/wiki/Reply_All_(podcast)" data-citeit-cited-series = 'Reply-All' data-citeit-cited-embed-url = 'https://player.gimletmedia.com:443/o2hx34' data-citeit-cited-series-url="https://gimletmedia.com/shows/reply-all/episodes" data-citeit-cited-series-youtube="https://www.youtube.com/playlist?list=PLS8aEHTqDvpInY9kslUOOK8Qj_-B91o_W" data-citeit-cited-event_date='2018-10-12' > (See more about Html custom-data attributes)
I would like to make the CiteIt database freely available so that a broad array of people to analyze it and develop aggregators.
I would hope that a new generation of search engines (or aggregators) could locate the top ranked citation comments associated with citation fragmnents and develop innovative user interfaces.
One desirable feature would be to classify and group quotes of a particular type together and display the top quotes for each group.
Surfacing citations from accross the internet has the potential to reduce the polarization that results when individuals cluster into their own homogenous groups (silos).
If CiteIt develops as a community, work should be done to intentially build Civil Conversations, as healthy communities take time and intentionality to develop.
Ultimately, if CiteIt were to succeed as a concept, it seems like the service might benefit from being built as a decentralized system that archives and verifies the original context of quotes. This would provide readers with greater trust in the data and defend against attempts at censorship.