Data Scientist - Machine Learning (NLP) - Inside IR35

City of London, London
£500 - £650 per day

Data Science contract available for a leading Pharmaceutical company, they are looking for an expert in Natural Language Processing (NLP) to come in and work as part of their Artificial Intelligence team.

The contract would be for 9 months initially and will be working as part of a major Data Science transformation programme. To be considered for this role you should have experience in the following areas:

  • Python
  • Machine Learning (Recent papers)
  • Natural Language Processing (NLP)
  • Knowledge Graphs
  • Neural Networks
  • Big-Data
  • Azure, Spark, Hadoop (Nice to have)

This contract would fall inside of IR35, it is a long term project working for a global leader in pharmaceuticals, they are looking for a data scientist who is experienced and able to hit the ground running, the day rate for this project would be between £500 - £600.day.

Send similar jobs by email
JAC19092020
City of London, London
£500 - £650 per day
  1. Contract
  2. Deep Learning and AI

Similar Jobs

Salary

£55000 - £70000 per annum

Location

London

Description

This is an exciting new opportunity for a Deep Learning & NLP Data Scientist to join a growing SaaS start-up in the AI space.

Salary

US$200000 - US$250000 per annum

Location

New York

Description

A global technology company worth over $10 billion is looking for a principal software engineer to work on ad-tech for their platform

Salary

US$100000 - US$120000 per annum

Location

Cincinnati, Ohio

Description

This is an opportunity for a pure Deep Learning Scientist to cut their teeth in a data-first fast paced environment in the world of consumer goods

Salary

£65000 - £70000 per annum

Location

London

Description

An award-winning AI start-up who are currently earning a lot of recognition for their work in the healthcare space.

Harnham blog & news

With over 10 years experience working solely in the Data & Analytics sector our consultants are able to offer detailed insights into the industry.

Visit our Blogs & News portal or check out our recent posts below.

From Broken Data Pipelines to Broken Data Headlines

This week's guest post is written by Moray Barclay. Two things have caused the UK’s Test & Trace application to lose 16,000 Covid-19 test results, both of which are close to my heart. The first is the application’s data pipeline, which is broken. The second is a lack of curiosity. The former does not necessarily mean that a data application will fail. But when compounded by the latter it is certain. Data Pipelines All data applications have several parts, including an interesting part (algorithms, recently in the news), a boring part (data wrangling, never in the news), a creative part (visualisation, often a backdrop to the news), and an enabling part (engineering, usually misunderstood by the news).  Data engineering, in addition to the design and implementation of the IT infrastructure common to all software applications, includes the design and implementation of the data pipeline. As its name suggests, a data pipeline is the mechanism by which data is entered at one end of a data application and flows through the application via various algorithms to emerge in a very different form at the other end. A well architected data application has a single pipeline from start to finish. This does not mean that there should be no human interaction with the data as it travels down the pipeline but it should be limited to actions which can do no harm. Human actions which do no harm include: pressing buttons to start running algorithms or other blocks of code, reading and querying data, and exporting data to do manual exploratory or forensic analysis within a data governance framework. The data pipeline for Test & Trace will look something like this:    a patient manually fills out a web-form, which automatically updates a patient listfor each test, the laboratory adds the test result for that patientthe lab sends an Excel file to Public Health England with the ID’s of positive patientsPHE manually transpose the data in the Excel file to the NHS Test & Trace systemthe NHS T&T system pushes each positive patient contact details to NHS T&T agentsfor each positive patient, an NHS T&T contact centre agent phones them. This is a not a single pipeline because in the middle a human being needs to open up an editable file and transpose it into another file. The pipeline is therefore broken, splitting at the point at which the second Excel file is manually created. If you put yourself in the shoes of the person receiving one of these Excel files, you can probably identify several ways in which this manual manipulation of data could lead to harm. And it is not just the data which needs to be moved manually from one side of the broken pipeline to the other side, it is the associated data types, and CSV files can easily lose data type information. This matters. You may have experienced importing or exporting data with an application which changes 06/10/20 to 10/06/20. Patient identifiers should be of data type text, even if they consist only of numbers, for future-proofing. Real numbers represented in exponential format should, obviously, be of a numeric data type. And so on. One final point: the different versions of Excel (between the Pillar 2 laboratories and PHE) are a side-show, because otherwise this implies that had the versions been the same, then everything would be fine. This is wrong. The BBC have today reported that “To handle the problem, PHE is now breaking down the test result data into smaller batches to create a larger number of Excel templates. That should ensure none hit their cap.” This solves the specific Excel incompatibility problem (assuming the process of creating small batches is error-free) but has no bearing on the more fundamental problem of the broken data pipeline, which will stay until the manual Excel manipulation is replaced by a normal and not particularly complex automated process. Curiosity So where does curiosity fit in? The first thing that any Data Analyst does when they receive data is to look at it. This is partly a technical activity, but it is also a question of judgement and it requires an element of curiosity. Does this data look right? What is the range between the earliest and the latest dates? If I graph one measurement over time (in this case positive tests over time), does the line look right? If I graph two variables (such as Day Of Week versus positive tests) what does the scatter chart look like? Better still, if I apply regression analysis to the scatter chart what is the relationship between the two variables and within what bounds of confidence? How does that relate to the forecast? Why? This is not about skills. If I receive raw data in csv format I would open it in a python environment or an SQL database. But anyone given the freedom to use their curiosity can open a csv file in Notepad and see there are actually one million rows of data and not 65,000. Anyone given the freedom to use their curiosity can graph data in Excel to see whether it has strange blips. Anyone given the freedom to use their curiosity can drill down into anomalies. Had those receiving the data from the Pillar 2 laboratories been allowed to focus some of their curiosity at what they were receiving they would have spotted pretty quickly that the 16,000 patient results were missing. As it was, I suspect they were not given that freedom: I suspect they were told to transpose as much data as they could as quickly as possible, for what could possibly go wrong? Single Data Pipeline, Singular Curiosity: Pick At Least One To reiterate, the current problems with T&T would never have arisen with a single data pipeline which excluded any manual manipulation in Excel. But knowing that the data pipeline was broken and manual manipulation was by design part of the solution, the only way to minimise the risk was to encourage people engaged in that manual process to engage their curiosity about the efficacy of the data they were manipulating. In their prototype phases – for that is the status of the T&T application - data projects will sometimes go wrong. But they are much more likely to go wrong if the people involved, at all levels, do not have enough time or freedom to think, to engage their curiosity, and to ask themselves “is this definitely right?” You can view Moray's original article here.  Moray Barclay is an Experienced Data Analyst working in hands-on coding, Big Data analytics, cloud computing and consulting.

Is Computer Vision at the Core of the New Normal?

Computer Vision is one of the fastest growing markets in Data & Analytics. While it was on a trajectory prior to the pandemic, the needs we have now have amped up the role Computer Vision plays in our day-to-day lives and businesses who want to keep up or get ahead are paying attention.  Unexpected Businesses Using Computer Vision Some unusual players leaning on these technologies are grocery stores. While some have pivoted to pickup and delivery, others have remained stagnant with yesterday’s shopping habits changed only to individuals in store wearing masks. For those who made the leap to the "new normal", they’re using things like shelf sensors and Machine Learning to automate ordering and determine best placement of a product. Though retail stores are no stranger to video analytics, the rise of Deep Learning and AI offer a more rapid analysis of video for real-time threat assessment. Teaching the machine to watch for crowding, erratic movement, or potential conflict allows for quick reaction or proactive measures to stop a conflict in play. Yet, behind all this Machine Learning and Computer Vision elements are people. Real live humans. And it’s their new normal which is a strong part of the world’s new normal as most everyone shifts and remains online, working remotely. Behaviours are changing and many businesses have differentiated themselves from others by staying ahead of the game.        Five Ways Businesses Are Moving Forward in the New Normal Remote work is here to stay. A jump of 18% of remote working after the pandemic is expected to remain key to many businesses. And nearly three quarters of executives, plan to increase their remote workers. Key components of this new change will be to bring onboard those with strong digital collaboration skills, ability to manage virtually, and reassess how goals and objectives are to be decided. How will businesses keep remote employees engaged, enthused, and feel part of the team when they could be miles or countries apart?Gig Workers as Cost-Saving Measure. As employees move out of office and online, gig workers are a go-to for businesses hoping to move forward and keep costs low. Performance management systems will need to be re-evaluated. After all, if the idea is to keep costs low (read: overhead), then how does the debate about whether or not to offer benefits fit in to the mix?Definitions are Changing. Whether the definition includes ‘critical skills,’ ‘critical role,’ or something similar. What these meant once are changing. Now, the focus is on how to encourage, mentor, or coach employees in professional development skills which can open up a variety of opportunities versus one set path to one set role.Keeping Track Virtually. Though most businesses tend to follow the model of ‘productivity and performance’ over simply hours worked, some organisations passively track their remote workforce. This keeping track can include timeclock software virtually managed to computer usage to monitoring communications. Several benefits of data tracking in this manner could be a boon to HR Managers as it could help to understand employee engagement. But it’s a fine line to traverse.Organisational Redesign Done with Efficiency in Mind. As everything from products to people move online, it’s more important than ever to ensure things like logistics, supply chains, and workflows are designed with efficiency in mind. Computer Vision AI models can help take these systems to the next level as things like grocery shopping, retail, and legacy businesses find their business must go online or pivot in the new normal to survive. In our recently released 2020 Salary Guide we discuss each specialism. What’s working. What isn’t. And how businesses can hire and retain top talent to keep their projects on track and their businesses running smoothly.If you’re interested in Data & Technology, Risk or Digital Analytics, Life Sciences Analytics, Marketing & Insight, or Data Science, check out our current opportunities. Alternatively, you can contact one of our expert consultants if you’d like to learn more. 

Recently Viewed jobs