Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
441.1
TFLOPS
60
17
83
Umar Butler
umarbutler
Follow
clemkoa's profile picture
farmah's profile picture
Flyxion's profile picture
50 followers
·
32 following
https://umarbutler.com/
umarbutler
umarbutler
AI & ML interests
Law, technology, AI and everything in between.
Recent Activity
posted
an
update
1 day ago
What happens when you annotate, extract, and disambiguate every entity mentioned in the longest U.S. Supreme Court decision in history? What if you then linked those entities to each other and visualized it as a network? This is the result of enriching all 241 pages and 111,267 words of Dred Scott v. Sandford (1857) with Kanon 2 Enricher in less than ten seconds at the cost of 47 cents. Dred Scott v. Sandford is the longest U.S. Supreme Court decision by far, and has variously been called "the worst Supreme Court decision ever" and "the Court's greatest self-inflicted wound" due to its denial of the rights of African Americans. Thanks to Kanon 2 Enricher, we now also know that the case contains 950 numbered paragraphs, 6 footnotes, 178 people mentioned 1,340 times, 99 locations mentioned 1,294 times, and 298 external documents referenced 940 times. For an American case, there are a decent number of references to British precedents (27 to be exact), including the Magna Carta (¶ 928). Surprisingly though, the Magna Carta is not the oldest citation referenced. That would be the Institutes of Justinian (¶ 315), dated around 533 CE. The oldest city mentioned is Rome (founded 753 BCE) (¶ 311), the oldest person is Justinian (born 527 CE) (¶ 314), and the oldest year referenced is 1371, when 'Charles V of France exempted all the inhabitants of Paris from serfdom' (¶ 370). All this information and more was extracted in 9 seconds. That's how powerful Kanon 2 Enricher, my latest LLM for document enrichment and hierarchical graphitization, is. If you'd like to play with it yourself now that it's available in closed beta, you can apply to the Isaacus Beta Program here: https://isaacus.com/beta.
new
activity
2 months ago
answerdotai/ModernBERT-base:
Is this model meant for full bfloat16, AMP bfloat16 or no bfloat16?
updated
a collection
4 months ago
Open Legal Data
View all activity
Organizations
umarbutler
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
answerdotai/ModernBERT-base
2 months ago
Is this model meant for full bfloat16, AMP bfloat16 or no bfloat16?
👍
3
4
#7 opened about 1 year ago by
umarbutler
commented
3 papers
4 months ago
The Massive Legal Embedding Benchmark (MLEB)
Paper
•
2510.19365
•
Published
Oct 22, 2025
•
18
•
5
The Massive Legal Embedding Benchmark (MLEB)
Paper
•
2510.19365
•
Published
Oct 22, 2025
•
18
•
5
The Massive Legal Embedding Benchmark (MLEB)
Paper
•
2510.19365
•
Published
Oct 22, 2025
•
18
•
5
New activity in
Prarabdha/indian-legal-supervised-fine-tuning-data
6 months ago
License
#2 opened 6 months ago by
umarbutler
New activity in
nguha/legalbench
6 months ago
LegalBench no longer loads on the latest version of datasets
2
#33 opened 6 months ago by
umarbutler
New activity in
allenai/gooaq
7 months ago
Many answers are stored as literal string representations of arrays
#4 opened 7 months ago by
umarbutler
New activity in
jhu-clsp/CLERC
7 months ago
License?
#7 opened 7 months ago by
umarbutler
New activity in
answerdotai/ModernBERT-large-training-checkpoints
8 months ago
Last final stable checkpoint
2
#1 opened 8 months ago by
umarbutler
New activity in
pietrolesci/nli_fever
over 1 year ago
Premise and hypothesis wrong way around?
2
#2 opened almost 2 years ago by
MoritzLaurer
New activity in
nguha/legalbench
over 1 year ago
Significant train/test imbalance makes this more tailored to GenAI rather than LLMs in general
3
#31 opened over 1 year ago by
umarbutler
New activity in
Xenova/gpt-4
over 1 year ago
Conversion to tiktoken
3
#4 opened over 1 year ago by
koyfman
New activity in
isaacus/open-australian-legal-embeddings
over 1 year ago
Dataset Viewer issue
1
#3 opened about 2 years ago by
umarbutler
New activity in
isaacus/open-australian-legal-qa
over 1 year ago
Fix typo in the dataset name
1
#20 opened over 1 year ago by
davebulaval
New activity in
umarbutler/better-cuad
over 1 year ago
[bot] Conversion to Parquet
#1 opened over 1 year ago by
parquet-converter
New activity in
isaacus/open-australian-legal-corpus
over 1 year ago
Releasing v5.0.0.
#4 opened over 1 year ago by
umarbutler
New activity in
isaacus/open-australian-legal-corpus
almost 2 years ago
BuilderConfig 'train' not found
1
#3 opened almost 2 years ago by
skoota
New activity in
yunconglong/Truthful_DPO_TomGrc_FusionNet_7Bx2_MoE_13B
almost 2 years ago
any contamination results?
2
#4 opened about 2 years ago by
wukongai
New activity in
isaacus/open-australian-legal-corpus
almost 2 years ago
Victoria?
4
#2 opened over 2 years ago by
lifedigital
New activity in
TheBloke/Mixtral-8x7B-Instruct-v0.1-AWQ
about 2 years ago
always getting 0 in output
👍
4
15
#3 opened about 2 years ago by
xubuild
Load more