Army intell wants to scan social media from 40 countries
The Intelligence and Security Command is looking for the ability to collect and analyze huge data sets of social media and open-source information in up to 66 languages.
The Army wants to be able to anonymously scan social media platforms and open-source information from up to 40 countries and in 66 languages, and perform big data analytics on huge data sets of the information in search of trends in political, military, economic and other areas. And it wants to be able to do it from a smartphone.
In a sources sought notice published at FedBizOpps.gov, the Army Intelligence and Security Command (INSCOM) said it “anticipates the need” for a non-attributable (anonymous) service that will collect and analyze the data, and allow INSCOM personnel to perform their own analyses using customized big data tools.
Intelligence agencies, like businesses and political campaigns, recognize the value of social media in track trends, public sentiment and the kind of emerging public uprisings that took place during the Arab Spring. Agencies from the Homeland Security Department to the Defense Advanced Research Projects Agency have looked to use social media analytics for signs of terrorism or as a conduit during emergencies. The challenges have included the size of the data and the fractured language used on the likes of Twitter and Facebook.
INSCOM’s notice said it’s looking for tool that can perform analytics on data from a changeable list of up to 40 countries and follow up to 10 analytical themes, including “political, military, economic, social, infrastructure, and information systems of foreign states,” as well as perform sentiment analysis and predictive analytics. The service should be available to INSCOM’s entire enterprise via smartphones or tablets, giving access to users anywhere in the field, the notice said.
Other features the Army is looking for include:
Automated reporting interfaces with additional focused-analytics as needed.
- The ability to search foreign social media and open source information and conduct predictive analysis, sentiment analysis and deliver situational awareness.
- Node clustering of big data utilizing a distributed, scalable and portable file-system and a software framework that allows for data-intensive applications.
- A non-intrusive framework for processing parallelizable problems across huge datasets using a large number of clusters on all ingested data sources.
In addition to supporting basic machine language translation for 66 languages, the tools also should support synonym translation for Pashto, and Arabic, and Urdu.
The languages INSCOM wants to translate: Afrikaans, Albanian, Arabic, Armenian, Azerbaijani, Basque, Belarusian, Bengali, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Esperanto, Estonian, Filipino, Finnish, French, Galician, Georgian, German, Greek, Gujarati, Haitian Creole, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Irish, Italian, Japanese, Kannada, Khmer, Korean, Lao, Latin, Latvian, Lithuanian, Macedonian, Malay, Maltese, Norwegian, Persian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Tamil, Telugu, Thai, Turkish, Ukrainian, Urdu, Vietnamese, Welsh, Yiddish.
Responses are due by Dec. 23 via email to Courtney Webber at Courtney.j.webber.civ@mail.mil.