Data Scientist

Hertfordshire
£500 - £600 per day

SENIOR DATA SCIENTIST
£500- £550 PER DAY
6 MONTHS
REMOTE BASED

A chance to work alongside other Data Scientists in building different price prediction models using Neural networks.

THE COMPANY:

As a Data Scientist you will be working for a start up who sell an innovative product within the renewable space. You will work in an agile team that has grown massively in the last few years form 3 to 15 people. As a Data Scientist you will be helping to leverage that AI capabilities through complex Machine Learning models. You will be working alongside both Data Scientists and Data Engineers.

THE ROLE:

As a Data Scientist you will be using Python extensively to help build ML models. As a Data Scientist you will be helping to improve their supply chain by helping to build an anomaly detector. As a Data Scientist you must have experience using Neural Networks, Tenser Flow or Keras as you will be using these tools to build complex ML models. As a Data Scientist you will be dealing with Big Data so any experience with tools such as Spark is useful. As a Data Scientist you will be expected to present your ideas to other members of the wider product team so you must be able to demonstrate you are an effective communicator. They work within AWS so any experience with tools such as Docker for deployment is also highly valuable.

YOUR SKILLS AND EXPERIENCE:

The successful Data Scientist will have the following skills and experience:

  • Extensive experience with Python and a good understanding of its libraries; Pandas, NumPy, SciPy
  • Commercial experience working on different customisation and recommendation models
  • Used Pyspark for ETL development and model building
  • Good commercial use of Neural Networks
  • PHD in Computer Science or equivalent

The ideal candidate will have

  • worked in AWS
  • Familiarity with Docker

THE BENEFITS:

  • A competitive day rate of up to £600

HOW TO APPLY:
Please register your interest by sending your CV to Anna Greenhill via the Apply link on this page.

Send similar jobs by email
49489/AG
Hertfordshire
£500 - £600 per day
  1. Contract
  2. Data science

Similar Jobs

Salary

€30000 - €39000 per annum + competitive benefits package

Location

Vilnius, Vilnius County

Description

An exciting role for a Data Analyst with advanced coding skills, interested in moving into a more Data Science focussed role.

Salary

€75000 - €85000 per annum + BENEFITS

Location

Düsseldorf, Nordrhein-Westfalen

Description

One of the leading German life-science companies is looking for a tech/data science-enthusiast who loves to innovate and work on high impact projects.

Salary

£70000 - £80000 per annum + 30% bonus and flexible working

Location

Milton Keynes, Buckinghamshire

Description

Are you interested in building to a fast emerging, forward-thinking,data-driven team?Do you have an interest in improving customer experience worldwide?

Salary

£70000 - £80000 per annum + 30% bonus and flexible working

Location

Milton Keynes, Buckinghamshire

Description

Are you interested in building to a fast emerging, forward-thinking,data-driven team?Do you have an interest in improving customer experience worldwide?

Harnham blog & news

With over 10 years experience working solely in the Data & Analytics sector our consultants are able to offer detailed insights into the industry.

Visit our Blogs & News portal or check out our recent posts below.

From Broken Data Pipelines to Broken Data Headlines

This week's guest post is written by Moray Barclay. Two things have caused the UK’s Test & Trace application to lose 16,000 Covid-19 test results, both of which are close to my heart. The first is the application’s data pipeline, which is broken. The second is a lack of curiosity. The former does not necessarily mean that a data application will fail. But when compounded by the latter it is certain. Data Pipelines All data applications have several parts, including an interesting part (algorithms, recently in the news), a boring part (data wrangling, never in the news), a creative part (visualisation, often a backdrop to the news), and an enabling part (engineering, usually misunderstood by the news).  Data engineering, in addition to the design and implementation of the IT infrastructure common to all software applications, includes the design and implementation of the data pipeline. As its name suggests, a data pipeline is the mechanism by which data is entered at one end of a data application and flows through the application via various algorithms to emerge in a very different form at the other end. A well architected data application has a single pipeline from start to finish. This does not mean that there should be no human interaction with the data as it travels down the pipeline but it should be limited to actions which can do no harm. Human actions which do no harm include: pressing buttons to start running algorithms or other blocks of code, reading and querying data, and exporting data to do manual exploratory or forensic analysis within a data governance framework. The data pipeline for Test & Trace will look something like this:    a patient manually fills out a web-form, which automatically updates a patient listfor each test, the laboratory adds the test result for that patientthe lab sends an Excel file to Public Health England with the ID’s of positive patientsPHE manually transpose the data in the Excel file to the NHS Test & Trace systemthe NHS T&T system pushes each positive patient contact details to NHS T&T agentsfor each positive patient, an NHS T&T contact centre agent phones them. This is a not a single pipeline because in the middle a human being needs to open up an editable file and transpose it into another file. The pipeline is therefore broken, splitting at the point at which the second Excel file is manually created. If you put yourself in the shoes of the person receiving one of these Excel files, you can probably identify several ways in which this manual manipulation of data could lead to harm. And it is not just the data which needs to be moved manually from one side of the broken pipeline to the other side, it is the associated data types, and CSV files can easily lose data type information. This matters. You may have experienced importing or exporting data with an application which changes 06/10/20 to 10/06/20. Patient identifiers should be of data type text, even if they consist only of numbers, for future-proofing. Real numbers represented in exponential format should, obviously, be of a numeric data type. And so on. One final point: the different versions of Excel (between the Pillar 2 laboratories and PHE) are a side-show, because otherwise this implies that had the versions been the same, then everything would be fine. This is wrong. The BBC have today reported that “To handle the problem, PHE is now breaking down the test result data into smaller batches to create a larger number of Excel templates. That should ensure none hit their cap.” This solves the specific Excel incompatibility problem (assuming the process of creating small batches is error-free) but has no bearing on the more fundamental problem of the broken data pipeline, which will stay until the manual Excel manipulation is replaced by a normal and not particularly complex automated process. Curiosity So where does curiosity fit in? The first thing that any Data Analyst does when they receive data is to look at it. This is partly a technical activity, but it is also a question of judgement and it requires an element of curiosity. Does this data look right? What is the range between the earliest and the latest dates? If I graph one measurement over time (in this case positive tests over time), does the line look right? If I graph two variables (such as Day Of Week versus positive tests) what does the scatter chart look like? Better still, if I apply regression analysis to the scatter chart what is the relationship between the two variables and within what bounds of confidence? How does that relate to the forecast? Why? This is not about skills. If I receive raw data in csv format I would open it in a python environment or an SQL database. But anyone given the freedom to use their curiosity can open a csv file in Notepad and see there are actually one million rows of data and not 65,000. Anyone given the freedom to use their curiosity can graph data in Excel to see whether it has strange blips. Anyone given the freedom to use their curiosity can drill down into anomalies. Had those receiving the data from the Pillar 2 laboratories been allowed to focus some of their curiosity at what they were receiving they would have spotted pretty quickly that the 16,000 patient results were missing. As it was, I suspect they were not given that freedom: I suspect they were told to transpose as much data as they could as quickly as possible, for what could possibly go wrong? Single Data Pipeline, Singular Curiosity: Pick At Least One To reiterate, the current problems with T&T would never have arisen with a single data pipeline which excluded any manual manipulation in Excel. But knowing that the data pipeline was broken and manual manipulation was by design part of the solution, the only way to minimise the risk was to encourage people engaged in that manual process to engage their curiosity about the efficacy of the data they were manipulating. In their prototype phases – for that is the status of the T&T application - data projects will sometimes go wrong. But they are much more likely to go wrong if the people involved, at all levels, do not have enough time or freedom to think, to engage their curiosity, and to ask themselves “is this definitely right?” You can view Moray's original article here.  Moray Barclay is an Experienced Data Analyst working in hands-on coding, Big Data analytics, cloud computing and consulting.

Is Computer Vision at the Core of the New Normal?

Computer Vision is one of the fastest growing markets in Data & Analytics. While it was on a trajectory prior to the pandemic, the needs we have now have amped up the role Computer Vision plays in our day-to-day lives and businesses who want to keep up or get ahead are paying attention.  Unexpected Businesses Using Computer Vision Some unusual players leaning on these technologies are grocery stores. While some have pivoted to pickup and delivery, others have remained stagnant with yesterday’s shopping habits changed only to individuals in store wearing masks. For those who made the leap to the "new normal", they’re using things like shelf sensors and Machine Learning to automate ordering and determine best placement of a product. Though retail stores are no stranger to video analytics, the rise of Deep Learning and AI offer a more rapid analysis of video for real-time threat assessment. Teaching the machine to watch for crowding, erratic movement, or potential conflict allows for quick reaction or proactive measures to stop a conflict in play. Yet, behind all this Machine Learning and Computer Vision elements are people. Real live humans. And it’s their new normal which is a strong part of the world’s new normal as most everyone shifts and remains online, working remotely. Behaviours are changing and many businesses have differentiated themselves from others by staying ahead of the game.        Five Ways Businesses Are Moving Forward in the New Normal Remote work is here to stay. A jump of 18% of remote working after the pandemic is expected to remain key to many businesses. And nearly three quarters of executives, plan to increase their remote workers. Key components of this new change will be to bring onboard those with strong digital collaboration skills, ability to manage virtually, and reassess how goals and objectives are to be decided. How will businesses keep remote employees engaged, enthused, and feel part of the team when they could be miles or countries apart?Gig Workers as Cost-Saving Measure. As employees move out of office and online, gig workers are a go-to for businesses hoping to move forward and keep costs low. Performance management systems will need to be re-evaluated. After all, if the idea is to keep costs low (read: overhead), then how does the debate about whether or not to offer benefits fit in to the mix?Definitions are Changing. Whether the definition includes ‘critical skills,’ ‘critical role,’ or something similar. What these meant once are changing. Now, the focus is on how to encourage, mentor, or coach employees in professional development skills which can open up a variety of opportunities versus one set path to one set role.Keeping Track Virtually. Though most businesses tend to follow the model of ‘productivity and performance’ over simply hours worked, some organisations passively track their remote workforce. This keeping track can include timeclock software virtually managed to computer usage to monitoring communications. Several benefits of data tracking in this manner could be a boon to HR Managers as it could help to understand employee engagement. But it’s a fine line to traverse.Organisational Redesign Done with Efficiency in Mind. As everything from products to people move online, it’s more important than ever to ensure things like logistics, supply chains, and workflows are designed with efficiency in mind. Computer Vision AI models can help take these systems to the next level as things like grocery shopping, retail, and legacy businesses find their business must go online or pivot in the new normal to survive. In our recently released 2020 Salary Guide we discuss each specialism. What’s working. What isn’t. And how businesses can hire and retain top talent to keep their projects on track and their businesses running smoothly.If you’re interested in Data & Technology, Risk or Digital Analytics, Life Sciences Analytics, Marketing & Insight, or Data Science, check out our current opportunities. Alternatively, you can contact one of our expert consultants if you’d like to learn more. 

Recently Viewed jobs