A data janitor, the sexiest job of the 21st century



A job invented in Silicon Valley is going mainstream as more industries try to gain an edge from big data.

The job description “data scientist” didn’t exist five years ago. No one advertised for an expert in data science, and you couldn’t go to school to specialize in the field. Today, companies are fighting to recruit these specialists, courses on how to become one are popping up at many universities, and the Harvard Business Review even proclaimed that data scientist is the “sexiest” job of the 21st century.

Data scientists take huge amounts of data and attempt to pull useful information out. The job combines statistics and programming to identify sometimes subtle factors that can have a big impact on a company’s bottom line, from whether a person will click on a certain type of ad to whether a new chemical will be toxic in the human body.

While Wall Street, Madison Avenue, and Detroit have always employed data jockeys to make sense of business statistics, the rise of this specialty reflects the massive expansion in the scope and variety of data now available in some industries, like those that collect data about customers on the Web. There’s more data than individual managers can wrap their minds around—too much of it, changing too fast, to be analyzed with traditional approaches.

As smartphones promise to become a new source of valuable data to retailers, for example, Walmart is competing to bring more data scientists on board and now advertises for dozens of open positions, including “Big Fast Data Engineer.” Sensors in factories and on industrial equipment are also delivering mountains of new data, leading General Electric to hire data scientists to analyze these feeds.

The term “data science” was coined in Silicon Valley in 2008 by two data analysts then working at LinkedIn and Facebook (see “What Facebook Knows”). Now many startups are basing their businesses on their ability to analyze large quantities of data—often from disparate sources. ZestFinance, for example, has a predictive model that uses hundreds of variables to determine whether a lender should offer high-risk credit. The underwriting risk it achieves is 40 percent lower than that borne by traditional lenders, says ZestFinance data scientist John Candido. “All data is credit data to us,” he says.

Data scientist has become a popular job title partly because it has helped pull together a growing number of haphazardly defined and overlapping job roles, says Jake Klamka, who runs a six-week fellowship to place PhDs from fields like math, astrophysics, and even neuroscience in such jobs. “We have anyone who works with a lot of data in their research,” Klamka says. “They need to know how to program, but they also have to have strong communications skills and curiosity.”

The best data scientists are defined as much by their creativity as by their code-writing prowess. The company Kaggle organizes contests where data scientists compete to find the best way to make sense of massive data sets (see “Startup Turns Data Crunching into a High-Stakes Sport”). Many of the top Kagglers (there are 88,000 registered on the site) come from fields like astrophysics or electrical engineering, says CEO Anthony Goldbloom. The top-ranked participant is an actuary in Singapore.

Universities are starting to respond to the job market’s needs. Stanford University plans to launch a data science master’s track in its statistics department, says department chair Guenther Walther. A dozen or so other programs have already been started at schools including Columbia University and the University of California, San Francisco. Cloudera, a company that sells software to process and organize large volumes of data, announced in April that it would work with seven universities to offer undergraduates professional training on how to work with “big data” technologies.

Cloudera’s education program director, Mark Morissey, says a skills shortage is looming and that “the market is not going to grow at the rate it currently wants to.” That has driven salaries up. In Silicon Valley, salaries for entry-level data scientists are around $110,000 to $120,000.

Others think the trend could create a new area of outsourcing. Shashi Godbole, a data scientist in Mumbai, India, who is ranked 20th on Kaggle’s scoreboard, recently completed a Kaggle-arranged hourly consulting gig, a new business the platform is getting into. He did work for a tiny health advocacy nonprofit located in Chicago and is now bidding on more jobs (he earns $200 per hour, and Kaggle collects $300 an hour). His Kaggle work is part time for now, but he says it’s possible that it could be his major source of income one day.

To the data scientists themselves, the job is certainly less sexy than it’s being made out to be. Josh Wills, a senior director of data science at Cloudera, says most of the time it involves cleaning up messy data—for example, by putting it in the right columns and sorting it.

“I’m a data janitor. That’s the sexiest job of the 21st century,” he says. “It’s very flattering, but it’s also a little baffling.”


Click here for the article on the web.



<< Click here to see more recent news articles >>

 

 

Harnham blog & news

With over 10 years experience working solely in the Data & Analytics sector our consultants are able to offer detailed insights into the industry.

Visit our News & Blogs portal or check out our recent posts below.

Bridging the Gap: The Role of a DevOps Engineer

Siloed teams are swiftly becoming a thing of the past as organizations learn collaboration is key. Businesses are embracing transformation. But some may not know where to turn to help them manage such a massive restructuring of operations. Enter the DevOps Engineer. Yes, Virginia. The unicorn employee does exist. What is a DevOps Engineer? For many businesses, it’s a dream to find a technical person who can also communicate across departments. In the DevOps Engineer role is an IT Generalist who not only has a deep understanding of codes, infrastructure management, and agile familiarity but who also possesses interpersonal skills. It’s this combination that makes this role so imperative to businesses. Working across siloes and bringing teams together for collaboration bridges the gap between the technical and non-technical departments. One of their most important roles is as advocate. Moving from siloed teams to the more collaborative environment of a DevOps culture can be difficult for engineering team members. But as advocate for the benefits, the DevOps Engineer can explain it best to those with whom they’ve worked. Their technical expertise puts them on par with their peers and their interpersonal skills offer a way to communicate across the organization.   Want to Restructure Your Skills toward DevOps? If you’re an IT Generalist with great communication skills. DevOps Engineer could be your next role. But what skills do you need and how might you streamline what you already know into this key role for many businesses? Technical skills depend on team structure, technologies in place, and tools already in use. But the key element of a DevOps Engineer is their strong communication and collaborative skills. Can you morph your technical world into layman’s terms for the executives? Can you translate different needs across teams from QA testers to software developers, generalists and specialists alike? It’s this deep understanding which makes you so valuable to employers. For many organizations, this is the best of both worlds.  Knowing the pros and cons of available tools. Understanding the components of a delivery pipeline. And strong communication skills to bridge once siloed teams into a cohesive and collaborative environment. More technical skills include, but aren’t limited to System administration – such as managing servers, database deployment, and system patching just to name a few.Experience with DevOps tools – understand the lifecycle from building and infrastructure to operating and monitoring a product or service.Configuration management – experience with configuration management tools such as Chef, Puppet, or Ansible to automate admin tasks.Continuous Integration (CI) and Continuous Deployment (CD) – this is a core practice of DevOps. It’s this role’s approach to software development with tools to automate the building, testing, and deploying of software processes. System architecture and provisioning – ability to design and manage computer ecosystems whether in-office or in the cloud. Within this skillset is the importance of Infrastructure as Code (IaC). This is an IT management process that applies best practices from software development to cloud infrastructure management.  Collaborative management skills – while the CI/CD skills are core to the technical side, this is one of the key components for the soft skills required for a DevOps structure. In a Nutshell DevOps (Development + Operations) is a practice that involves new management principles and requires a cultural change. And a DevOps Engineer is the heart of the transformation. Yet they can’t do it alone. A good DevOps Team has more than just one engineer. It involves a mix of generalists and specialists to implement and improve these practices within the software development cycle. A few of these roles include:  DevOps evangelist Automation expert Software developer Quality assurance  If you’re interested in Big Data and Analytics, Harnham may have a role for you. Check out our current vacancies or contact one of our expert consultants to learn more.  For our West Coast Team, contact us at (415) 614 - 4999 or send an email to sanfraninfo@harnham.com.  For our Mid-West and East Coast teams contact us at (212) 796-6070 or send an email to newyorkinfo@harnham.com.  

Amped Up Analytics: Google Analytics 4

Google Analytics 4 has amped up data insights into the behaviors and preferences of your customers. Where once each touchpoint only tracked what had been clicked, GA4 is bringing it all together in a more wholistic approach to the customer journey. As the fourth quarter of 2020 dawned, Google upped its game. Crafting a compelling array of features with machine learning at its core, this new platform offers a more customer-centric approach to data-driven insights, rather than split data across platforms and devices.   Though still in its infancy, there are some dramatic new changes afoot. And while it’s not a good idea to get rid of the old Universal Analytics platform before ringing in the new one, it is a good idea to understand what’s available now and what may come to be over time. Four Advantages to Google Analytics 4.0 From our desktop to our laptop to our smartphone, we carry our office in our pocket or on our lap. So, what better way to integrate what was once called “App + Web properties” into a more cohesive trackable measurement of data. Add to this the privacy protocols in place to protect customers, and Google Analytics 4 offers flexibility for future cookieless tracking and permissions, and advantages are revealed. Combined Data and Reporting Rather than focusing on one property (web or app) at a time, this platform allows marketers to track a customer’s journey more holistically.  The platform’s premise is that there is a pattern everyone follows. From the moment a customer visits your website to clicks on a button subscribing to your newsletter or blog – Acquisition and Engagement. To the moment your customer makes a purchase, is happy with the product or sevice, and comes back again – Monetization and Retention.  Designed for marketers who want to track users across multiple formats, Google Analytics 4 hopes to solve with Data Streams. These Data Streams merge to paint a picture of the customer journey from website visit to purchase. A Focus on Anonymized Data This anonymization answers the call to Data Privacy and third-party data collection. Crafting a unified user journey centered around machine learning to fill in any gaps, marketers and businesses have a way to get the information they need without diving into personal data issues. This is a key change in that Google is moving away from client-side focus and using server-side and customer-centric capabilities. With GDPR and privacy laws in full swing, marketers face enhanced privacy regulations as cookies are phased out or blocked. Predictive Metrics and Audiences Using Machine Learning to predict future transactions is a game changer for the platform. These predictive metrics for e-commerce sites on Google properties allow for targeted ads to visitors who seem most likely to make a purchase within one week of visiting the site.  Though focused on e-commerce sites now and based on transactions and revenue, there is an opportunity for marketers to identify and convert based on such leads as video views or form submissions. Machine Learning-Driven Insights The launch announcement for GA4 explains it “has machine learning at its core to automatically surface helpful insights and gives you a complete understanding of your customers across devices and platforms.” Machine Learning-driven insights include details that elude human analysts.  What These Changes Mean on the Digital Frontier We’re all reaching for higher value and Google Analytics 4.0 brings it into one unified platform for the future. As we make the shift from traditional Google Analytics to its 4.0 version, there is opportunity to get more creative.   Wondering if you should upgrade? This article breaks down the pros and cons to help you decide.  If you’re interested in Big Data & Analytics, Harnham may have a role for you. Check out our current vacancies or contact one of our expert consultants to learn more.  For our West Coast Team, contact us at (415) 614 - 4999 or send an email to sanfraninfo@harnham.com.  For our Mid-West and East Coast teams contact us at (212) 796-6070 or send an email to newyorkinfo@harnham.com.  

Making Sense of Unstructured Data with NLP

Natural Language Processing. It seems a simple enough explanation. The idea is to make computers sound like native speaking humans regardless of their language. Except there’s one problem. When we speak, we don’t follow our own rules of grammar. We use idioms, metaphors, abbreviations, and oftentimes use more body language to communicate than we realize.  So, what’s a poor machine to do when confronted with such an unstructured melee of data? Well, since semantics is not what you say it’s how you say it, we must teach computers to read between the lines. Of code. Enter NLP. The semantics of human language written for a machine to help make sense of our human behaviors gets organized. The Perfect Imperfections of Language Computers require structure. Natural language does not. Teaching machines how we communicate is no easy task, and yet we use machines that can do this every day. By combining technology and Machine Learning we begin to teach computers how to understand us. We teach them how to interpret and determine what it was we want done. When you’re asking Siri or Alexa a question, you’re helping them to learn how you ask, so they can better respond, and they make you more efficient. It’s a win-win for everyone. In business, using NLP techniques to drive business decisions is even more important. Now, the computer must decide what information is the most valuable to pull from a pile of Data. Understanding our choices, our tone, even the words we choose to use, helps our machines learn what we want to do or need done. Where is NLP Used? Since we use different rules when we speak than when we write, our computers learn how we talk and how to use language more naturally. Wondering where NLP might be used? In a word or two? Nearly everywhere. You are scheduling a meeting and when it’s time, a calendar reminder pops into your phone which says estimated drive time to the meeting based on traffic conditions in your area. Or you ask Alexa to play your favorite music list from Pandora.  Every touchpoint in this scenario is using NLP. We naturally might get into our car, ask our Virtual Assistant navigation system for directions, or to play our favorite music. Our choices don’t fit in a box and may not be logical, but the more we teach the machines, the closer they may get to understanding the nuances of our language. Here are 5 more ways we use NLP every day: Predictive text on your phone or in your Word document. Chatbots and Virtual Assistants to ensure customers are acknowledged in a timely manner, answer basic questions or redirect to appropriate personnel, and making suggestions to improve the customer experience.Curating social media feeds to determine customer needs and interest.Grammar correction software so our emails and business documents are error-free.Analyzing customer interactions using comments and reviews for customer feedback about a product or service. There’s a ton of information to be filtered, sorted, sifted, and analyzed, and NLP is just one of the tools Data Scientists use. Interested in the subfield of NLP? Check out this article for 6 techniques you need to know to get started. Already well-versed in the industry and looking for a new challenge? If you’re interested in Big Data and Analytics, Advanced Analytics, Life Sciences, Data Science, or any of our Data professional fields, we may have a role for you. Review our current vacancies or contact one of our expert consultants to learn more.   For our West Coast Team, contact us at (415) 614 - 4999 or send an email to sanfraninfo@harnham.com.   For our Mid-West and East Coast teams contact us at (212) 796-6070 or send an email to newyorkinfo@harnham.com.  

Smile: How Tech is Transforming the Dental Industry In 3D

Ever wondered what’s new at the dentist’s office? If you’re in the hot seat for dentures, crowns, or braces, you may be surprised at the speed you find yourself with a new smile.  Imagine a new set of teeth printed layer by layer before your eyes. Ok, before your dentist’s eyes. 3D printing has been used to print prosthetic limbs, orthopedic and cranial implants, surgical instruments, crowns, and dental restorations.  Electronic Health Records. AI-assisted surgeries. Machine Learning algorithms for more efficient workflows in hospitals and doctors’ offices. Medical technology isn’t new. But what about dental technology? In the Life Sciences field, technology is helping to shape the future of how we heal.  What is 3D Printing? According to the FDA, “3D printing is a process that creates a three-dimensional object by building successive layers of raw material. Each new layer is attached to the previous one until the object is complete. Objects are produced from a 3D file, such as computer-aided design (CAD) drawing or a Magnetic Resonance Image (MRI). The flexibility of this technology allows creation of individualized products such as prosthetics, dentures, or crowns specific to the individual requiring the device.  “It’s Not the Drill, It’s the Bill” Borrowed from an old commercial, the tagline originally implied patients weren’t afraid of the dentist, but of the bill at the end of the appointment. But with today’s technologies, particularly through the benefits of 3D printing, this tagline isn’t quite so dramatic. Here are a few ways, 3D printing in dentistry is benefitting both doctor and patient.  1. The Lab is Onsite Cost savings begin here. When the dentist can do his or her own lab work onsite, it’s less cost to consumers and to the dentist office’s bottom line. Add in the user-friendliness of the available 3D machines which allows dentists to produce molds, models, crowns, bridges, there’s plenty of opportunity to be more efficient and have more control over time and quality of the product.  3D Printers range in price from $20,000-$100,000+ for industrial printers. If you have a dental practice, you could most likely snag a desktop model for around $6,000 or less. Compare that to over $100,000 for outsourcing lab work, labor, and shipping costs included. 2. Getting it Right – More Accurate and Faster Services Reduce errors and increase accuracy when using 3D printing to convert digital images into physical objects within minutes. Watch as your patient’s dentures, for example, are printed layer-by-layer and usable with minutes, not hours or days.  Your technician can get to work as soon as the scan is ready and won’t be inhaling plaster or grinding dust while they work. A clean work space is a safe work space, no matter the industry. 3. Better Quality Products  Skilled dental technicians are still in high demand. But with the advent of 3D printing, their jobs are made a bit easier, and they’re able to design and create better quality products. Milled models could wear down over time. But a 3D model offers more stability and durability than its predecessor. Additionally, this digital model creates a more complex structure and offers a higher level of detail that may not be available in more traditional modeling techniques. 4. Enhanced Patient Experience 3D printing technologies have enhanced patient experience by reducing anxiety and increasing patient acceptance. How? Well, when you can print a model to help explain what’s going to be happening to identify and solve a patient’s problems, it can help alleviate their stresses of the unknown. Add to this a more efficient workflow, more aesthetically pleasing products, and less invasive treatments which make the patient’s visit go more smoothly, and you have a satisfied customer. 5. Save Money Last, but not least, is probably the biggest benefit to both patient and provider. Saving money. Though the upfront investment in a 3D can run into around $20,000 for a top model, it includes all the necessary components printer, reduces the need for skilled staff to produce dentures, implants, and other dental restorative models.  These savings are then passed on to the patient not only monetary value, but in time. The more accurate, efficiency, and speed of 3D printers means less time at the dentist’s office. Less return visits. Less error. With an estimated savings up to 80 percent depending on patient’s needs Smile. Tech is transforming the dental industry. Want to see where it can take you? If you’re interested in Big Data & Analytics, Advanced Analytics, Life Sciences, Data Science, or any of our Data professional fields, we may have a role for you. Check out our current vacancies or contact one of our expert consultants to learn more.  For our West Coast Team, contact us at (415) 614 - 4999 or send an email to sanfraninfo@harnham.com.  For our Mid-West and East Coast teams contact us at (212) 796-6070 or send an email to newyorkinfo@harnham.com.  

Recently Viewed jobs