Displaying 14 resources
White paper White paper

A General Language Assistant as a Laboratory for Alignment

Given the broad capabilities of large language models, it should be possible to work towards a general-purpose, text-based assistant that is aligned with human values, meaning that it is helpful, honest, and harmless.

Category
Trustworthiness
Source
Adra-e
White paper White paper

Ensuring the Safety of Artificial Intelligence

The essays are some of our first steps towards an understanding of how to make today’s choices in ways that take the people of tomorrow seriously. This is not an easy undertaking.

Category
Legal framework and regulation, Recommendations towards policy changes, Best practices in deployment
Target audience
ADR Experts and Associations, Individual Citizens/Members of the Society, Policy Makers, Private Sector, Public Sector, Researchers and Academic
Source
Adra-e