What’s the point: Google AI, JetBrains’ big data tools, Subversion, Docker

what’s the point: google ai, jetbrains’ big data tools, subversion, docker


Google’s AI teams used the last days of May to share their advances in natural language generation evaluation and demonstrate how the large-scale pre-training approach currently hot in the language domain could be used in the field of computer vision. 

The idea behind the latter boils down to pre-training general features with a variety of datasets and fine-tuning the resulting model with less data for a task of interest in a second step. The exact methodology is explained in “Big Transfer (BiT): General Visual Representation Learning” with models and notebooks available via a project repository.

Meanwhile the language team has been busy using pre-training to find a way of measuring the quality of systems generating natural language. The result is called BLEURT, a “learned evaluation metric based on BERT that can model human judgments with a few thousand possibly biased training examples”. The examples mentioned stem from public rating collections and additional user input.

BLEURT is meant to combine advantages of human evaluation and automatic metrics, while being more performant than competing approaches. The first results seem to compare well enough to yield further investigation, with the team staging to look into multilingualism and multimodality next.

JetBrains opens Big …



Looking Inside A Big Data Toolbox

looking inside a big data toolbox


Getting the meat of any data pool into burger-shaped perfection ready for consumption requires a few … [+] tools.

Wikimedia Commons

Much like term super, the big in big data comes with a certain amount of hype. Just as we now have supercars, supermodels, superspreaders and super-sized meals, we now have big business, big data and of course Big Macs.
Regardless of the hype cycle, big data has firmly entered our tech-business vocabulary. We now use it as a kind of blanket term when we talk about the massive web-scale information streams being passed over the cloud, inside the Internet of Things (IoT) and throughout the new realms of Artificial Intelligence (AI).
Broadly meant to refer to an amount of data that is too large to fit comfortably or productively into anything that resembles a ‘traditional’ relational database management system, big data is still just data… but it includes core operational enterprise data plus all the pieces of information that an organization knows it has, but is perhaps yet to act upon.

No-code wrench and spanner in the big data toolbox
To wrangle our way through the mire of big data, an increasing number of software companies are getting into the big …


READ MORE FROM SOURCE ARTICLE Continue reading “Looking Inside A Big Data Toolbox”

Splunk Adds To Big Data Platform’s Cloud, Machine Learning Capabilities

splunk adds to big data platform’s cloud, machine learning capabilities


Splunk is extending the cloud and machine learning functionality of its big data platform and leveraging a new strategic partnership with Google Cloud to continue the company’s push for cloud delivery of its software.
Splunk’s goal is to have its cloud-based products, including Splunk Cloud, Splunk SignalFx and VictorOps, account for 60 percent of the company’s sales in two years, up from the current 30 percent, said Sendur Sellakumar, Splunk chief product officer and senior vice president and general manager of cloud.
“We have shifted our business to be a cloud-first delivery,” Sellakumar said in an interview CRN.

[Related: Splunk Switch To Subscription Model Almost Complete, Reports 27 Percent Q4 Growth]

While many of Splunk’s products now run on cloud platforms and are sold on a subscription basis, some, including the Splunk Phantom security orchestration and automation software and Splunk User Behavior Analytics, are still sold using traditional licenses and for on-premises use.
Under the partnership with Google announced earlier this month, the Splunk Cloud machine data platform service will run on the Google Cloud Platform. Currently in beta, that is expected to be generally available later this summer. Splunk Cloud already runs on Amazon Web Services and the …


READ MORE FROM SOURCE ARTICLE Continue reading “Splunk Adds To Big Data Platform’s Cloud, Machine Learning Capabilities”

Three Powerful Benefits of Big Data

three powerful benefits of big data


Big data gives some astonishing advantages to a wide range of organizations over the globe. From the education sector to the healthcare industry, pretty much every industry is currently bound to big data analytics in a few or the other way. From the earliest starting point of time until 2003, the whole world just had five billion gigabytes of data. A similar amount of data was produced over just two days in 2011. By 2013, this volume was created every 10 minutes. It is, hence, to be expected that a generation of 90% of the considerable amount of data on the planet has been in the previous few years.
The speed at which information is streamed, these days, is uncommon, making it hard to manage it in a timely fashion. Smart metering, sensors, and RFID labels make it important to manage data torrents in practically real-time. Most companies are thinking that it’s hard to respond to data rapidly.

Better Decision Making
The primary advantage of utilizing Big Data Analytics is that it has helped the decision-making process to a great extent. Instead of anonymously deciding, organizations are thinking about Big Data Analytics before closing to any decision. A variety of customer-centric factors like what …


READ MORE FROM SOURCE ARTICLE Continue reading “Three Powerful Benefits of Big Data”

How Big Data Analytics & AI Can Help Boost Bee Populations

how big data analytics & ai can help boost bee populations


The veracity of a popular claim – often attributed to Albert Einstein – that mankind can survive only four years if all bees in the world disappeared is difficult to address. But what is certain is that they are critical pollinators of crops whose demise would not augur well for humans.

Driven by this – or purely altruistic motives – several companies including Microsoft, Oracle and SAS, are employing big data analytics and AI to help boost the declining population of bees. While some are involved in innovative projects that supplement manual beehive inspections that can be time-consuming, others are using AI and visual analytics to develop smart beehives and apps to understand bees’ behaviour better.

Let us examine some of these initiatives and how they are using emerging technologies to boost the population of bees in the world.

SAS’ Use of Advanced Analytics & ML

With four beehives at its main campus in North Carolina, SAS is well-positioned to closely work with beekeepers to monitor real-time conditions of hives. Equipped with sensors, these hives are being deeply studied with the resultant auditory data and ML algorithms developed. Furthermore, SAS is working with Appalachian State University to create data visualisations on the world’s bee …


READ MORE FROM SOURCE ARTICLE Continue reading “How Big Data Analytics & AI Can Help Boost Bee Populations”

Scientists are collecting big data to expedite the fight against COVID-19

scientists are collecting big data to expedite the fight against covid-19


A nationwide collaboration of clinicians, informaticians and other biomedical researchers aims to turn data from hundreds of thousands of medical records from coronavirus patients into effective treatments and predictive analytical tools that could help lessen or end the global pandemic.

Through the National COVID Cohort Collaborative, about 60 clinical institutions affiliated with the National Institutes of Health-supported Clinical and Translational Science Awards Program are invited to partner with U.S. Department of Health & Human Services agencies and clinical organizations. Together, Collaborative members will support the analysis of electronic health records on a new, secure database.

The National COVID Cohort Collaborative is supported as part of a $25 million NIH award to the National Center for Data to Health, which is coordinating the collaborative’s efforts and is based at Oregon Health & Science University’s Oregon Clinical and Translational Research Institute.

NIH’s National Center for Advancing Translational Sciences, also known as NCATS, is providing overall stewardship of the Collaborative.

“There is no centralized health care data in the United States,” explained Melissa Haendel, Ph.D., the Collaborative’s lead investigator, National Center for Data to Health director, an associate professor of medical informatics and clinical epidemiology in the OHSU School of Medicine, and translational data science …


READ MORE FROM SOURCE ARTICLE Continue reading “Scientists are collecting big data to expedite the fight against COVID-19”

JetBrains joins the big data space with new tools for DataGrip and PyCharm – SD Times

jetbrains joins the big data space with new tools for datagrip and pycharm – sd times


JetBrains announced that Big Data Tools is now available as EAP for DataGrip and PyCharm Professional. The news aims to address problems that involve both code and data. 
The company first announced plans to support more big data tools last year when it announced a preview of the IntelliJ IDEA Ultimate plugin with Apache Zeppelin notebooks integration. Since the plugin started with only Scala support, it made sense to only make it available for IntelliJ IDEA Ultimate. But now that the team has added more support for a wider set of scenarios and tools, JetBrains felt it was time to extend out the capabilities and make it available to other IDEs.
“We believe the plugin will extend the capabilities of DataGrip users when it comes to working with distributed file storage systems and columnar file formats. At the same time, the users of PyCharm who use PySpark or who also work with data will benefit from having this plugin available in their IDE,” the team wrote in a post.
The current feature set includes a file browser for distributed file storage systems such as AWS S3, HDFS, and GCS, with support for other clouds such as Microsoft Azure in the …


READ MORE FROM SOURCE ARTICLE Continue reading “JetBrains joins the big data space with new tools for DataGrip and PyCharm – SD Times”

Crazy cheap Three SIM only deal delivers big data, 5G and unlimited calls/texts

crazy cheap three sim only deal delivers big data, 5g and unlimited calls/texts


Three has just turned on the SIM only deal afterburners, with this brand new SIMO package delivering unlimited calls, unlimited texts and 12GB of data for just £8 per month.And, what makes this deal even better is that it is a short 12-month. That’s a short contract length, big data and an incredibly low price.It gets even better than that, though. That’s because this deal also comes with free delivery, free 5G, free personal hotspot, and free roaming around the world.This is a proper SIM only deal bargain, and the full details of the deal can be viewed below: SIM only | Unlimited calls and texts | 12GB data | £8 a month | Contract length: 12 months | Available now at ThreeIf the idea of spending only £8 per month on your SIM plan appeals then this data-stuffed deal from Three is absolutely worth considering. It delivers 12GB of data to burn each month, as well as unlimited minutes and texts for just £8 per month. Free 5G, roaming and delivery is also included. Superb!View DealIf you like the sound of the £8 per month Three SIM only deal above but, actually, fancy picking up unlimited data in your next SIM upgrade then be sure to …


READ MORE FROM SOURCE ARTICLE Continue reading “Crazy cheap Three SIM only deal delivers big data, 5G and unlimited calls/texts”

How construction firms can capitalise on their data

how construction firms can capitalise on their data


You have data – here’s what to do with it, says Karthik Venkatasubramanian, vice-president of data and analytics at Oracle Construction & Engineering
Data continues to fascinate the construction industry. Where other industries are much more mature in terms of their ‘big data’ journey, in construction it’s still gaining traction. Construction is often considered being behind in its digitisation journey, but in this weakness lies a significant strength: the ability to innovate like never before. And the opportunities presented by technology are starting to justify the cost more than ever.
We’re at a stage where it’s challenging to understand how construction and engineering companies managed without what we now consider ‘big data,’ particularly considering the level of control, transparency, threat awareness, and accountability it provides. Today, the industry appears data-hungry with companies keen to know what data science – including artificial intelligence (AI) or machine learning (ML) – can do for them.
This clamour for innovative data solutions invites some obvious questions: have construction businesses ignored the data they already have? How can they capitalise on the sheer volume of data they want? Can they even cope with more data? But also, is more data really the solution?
Data blindness


READ MORE FROM SOURCE ARTICLE Continue reading “How construction firms can capitalise on their data”

Why to Consider Data Science in 2020?

why to consider data science in 2020?


Today, more and more organizations are leveraging big data and its potential as they are generating a voluminous amount of data every day. However, this is increasing the value of data science and the demand of data scientists that use scientific methods, processes, algorithms and systems to mine insights from both structured and unstructured data. There is no doubt data is an essential asset for every business, enabling executives and leaders to make effective decisions based on facts to bolster their productivity and profitability.
Considering a market report, the global market of big data is predicted to grow at a CAGR of 0.6 percent from US$138.9 billion in 2020 to US$229.4 billion by 2025. It has also been projected that the data science platform market size will grow at a CAGR of around 30 percent from US$37.9 billion in 2019 to US$140.9 billion by 2024.
Data Science plays a vital role in any business, supporting businesses to deliver germane products, assisting in minimizing peril and fraud in business, and helping business leaders to understand and manage their data effectively.
Already, most companies that use big data and analytics technologies into their business processes are benefiting from the growing democratization of data. With a large set of …


READ MORE FROM SOURCE ARTICLE Continue reading “Why to Consider Data Science in 2020?”